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Preface 


While linguistic theory is in continual flux as progress is made in our ability to under- 
stand the structure and function of language, one constant has always been the central 
role of the word. On Looking into Words is a wide-ranging volume spanning current 
research into word-based morphology, morphosyntax, the phonology-morphology in- 
terface, and related areas of theoretical and empirical linguistics. The 26 papers that con- 
stitute this volume extend morphological and grammatical theory to signed as well as 
spoken language, to diachronic as well as synchronic evidence, and to birdsong as well 
as human language. Central concerns of the volume’s authors include morphological 
gaps, learnability, increases and declines in productivity, and the interaction of different 
components of the grammar. 

The contributions in the volume explore linked synchronic and diachronic topics in 
phonology, morphology, and syntax (in particular, inflection and cliticization) and their 
implications for linguistic theory, and are informed by data from a wide range of lan- 
guage families. Languages discussed in the contributions include Ancient and Modern 
Greek (Hale, Kavitskaya), Macedonian (Kaisse), Sanskrit, Finnish, and Sakha (Kiparsky), 
Middle Indo-Aryan (Deo), Rumantsch (Maiden), Latin and French (Timberlake), Russian 
(Spencer), Icelandic and Faroese (Ihráinsson), Welsh (Hammond), Hebrew (Bat-El, Hor- 
vath & Siloni), Limbu and Southern Sotho (Stump), Lusoga (Hyman & Inkelas), Yidiny 
(Round), Japanese (de Chene), and Tsotsil Mayan (Aissen), as well as ASL and other 
signed languages (Lepic & Padden, Napoli), as well as English. 

The papers are offered as a tribute to Stephen R. Anderson, the Dorothy R. Diebold 
Professor of Linguistics at Yale, who is retiring at the end of the 2016-2017 academic 
year and who has long been a major figure in several of the fields under investigation 
in the volume: phonology and morphology (and the history of scholarship on those 
topics), the of morphology-syntax interface, the relation of theory to data, the evolution 
of human language and its relation to animal communication systems. The contributors 
- friends, colleagues, and former students of Professor Anderson - are all important 
contributors to linguistics in their own right. The central contributions that Anderson 
has made to so many areas of linguistics and cognitive science, drawing on synchronic 
and diachronic phenomena in diverse linguistic systems, are represented through the 
papers in the volume. 

The papers in Part I of the volume focus on phonology at the level of the word and 
below. Ellen Kaisse draws on evidence from literary Macedonian to investigate why 
stress assignment is word-bounded, despite the existence of higher-level prosodic inter- 
actions - that is, she discusses the reasons why lexical stress is specifically lexical. Dasha 
Kavitskaya revisits a robust generalization from work by de Chene and Anderson that 
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predicts compensatory lengthening should be structure preserving. She argues that a 
class of apparent exceptions to this generalization have independently motivated analy- 
ses consistent with the generalization. Juliette Blevins looks at the role of language con- 
tact in determining cross-linguistic strategies for the resolution of stop + liquid clusters 
through vowel epenthesis as a recurrent feature of loan-word phonology. Erich Round’s 
contribution addresses issues of learnability in determining the class of possible gram- 
mars. He discusses the implications of word-final deletion in Yidiny (Pama-Nyungan) for 
the theory of exceptionality, which he shows cannot be represented in purely phonolog- 
ical or purely diacritic terms. Like several papers in the collection, Hóski Thráinsson's 
examines the diachronic underpinnings of synchronic linguistic structure. He uses con- 
trastive evidence from alternations in Modern Icelandic and Modern Faroese to argue for 
different trajectories in historical change in u-umlaut and accounts for differing degrees 
of synchronic productivity and transparency of the umlaut rules in the two languages. 
The papers in Part II concentrate on issues in morphological theory, in several cases 
also drawing on the interaction of synchronic and diachronic factors. Martin Maiden 
argues for the use of comparative and historical evidence in deciding between possi- 
ble synchronic interpretations of phenomena. He revisits paradigmatic phenomena in 
Rumantsch and investigates whether allomorphy in verb stems is phonologically con- 
ditioned or is evidence for paradigm autonomy. Brent de Chene's paper looks at an 
apparent argument for a syntactic approach to derivational stem morphology (as pro- 
posed in the theory of Distributed Morphology) in languages like Japanese and shows 
that a closer examination of the facts supports the approach defended by Anderson on 
which it is just inflectional (and not derivational) processes that lend themselves to a syn- 
tactic analysis. Anderson's A-Morphous Morphology is an item-and-process approach 
that takes words to be related to each other by morphological processes; Semitic lan- 
guages, with their celebrated triconsonantal roots, were explicitly excluded from the 
purview of Anderson’s analysis. In her contribution, Outi Bat-El — invoking data from 
both adult and child language - argues that in fact the Anderson-style item-and-process 
framework can be naturally extended to provide an empirically grounded account of 
word relations in Modern Hebrew, with the word/stem taken as basic and the putative 
triconsonantal root seen as a mere residue of phonological elements. Mike Hammond 
applies data from English and Welsh to the examination and measurement of morpho- 
logical complexity, applying the statistical framework of Input Optimization developed 
by Hammond himself in earlier work to questions arising in morphological investigation. 
Larry Hyman and Sharon Inkelas (with Fred Jenga) address a different type of question 
for morphological theory, the nature of and motivation for multiple exponence. Lusoga, 
a Bantu language spoken in Uganda, exhibits multiple exponence of causative, applica- 
tive, subjunctive and perfective suffixes and the apparent intermingling of derivational 
and inflectional affixation processes, properties which Hyman & Inkelas see as posing 
challenges for theories of morphology that seek to minimize redundancy and treat deri- 
vation as distinct from (and invariably ordered before) inflection. Finally, Charles Yang's 
paper examines a different issue for morphological theory, that of morphological defec- 
tiveness arising when a productive process fails to be recognized by the language learner. 
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He reconsiders the “amn’t problem - the absence (in most dialects of English) of an in- 
flected negation for the first person singular copula - through the lens of his Tolerance 
Principle, a corollary of the Elsewhere Principle motivated by the behavior of speakers 
with respect to paradigm gaps in English, Russian, Spanish, and other languages. As 
in other papers in this section (and in the volume more generally), Yang extends his 
view to consider the implications of the processes he describes for language acquisition, 
variation, and change. 

Part III encompasses a set of issues relating morphology to syntax, drawing on com- 
plex synchronic and diachronic evidence from a diverse set of languages. Sandy Chung's 
paper returns to the interaction of causative marking and number agreement in Cha- 
morro, which has been recognized as a challenge for the traditional view that inflectional 
affixes invariably attach outside derivational ones. While Anderson proposed taking 
number agreement to be derivational, Chung argues that the apparent causative prefix 
is actually a prosodically deficient verb, thus dissolving the apparent counterexample to 
the inflection-outside-derivation while preserving the intuition that number agreement 
is inflectional. Another contribution to morphosyntactic investigation in this section is 
Randy Hendrick's argument for analyzing the English preposition of (appearing in de- 
gree inversion expressions like more complex of a theory) as a phrasal affix rather than 
a syntactic head, thus predicting certain distributional features of the construction and 
allowing an explanation of the appearance of of in fractional expressions (one third of a 
cup). 

Several of the papers in this section examine the nature of clitics. Mark Hale investi- 
gates the syntax underlying the distribution of clitics that fall under Wackernagel's Law, 
which describes the tendency of clitics to appear in second (or, now, "Wackernagel") 
position in the sentence. Utilizing diachronic data from Ancient Greek and Indo-Iranian, 
Hale provides a detailed examination of cases in which there are multiple demands on a 
Wackernagel position and of the devices for resolving those conflicts, ultimately taking 
Wackernagel’s Law to be epiphenomenal on other phenomena. Judith Aissen similarly 
considers exceptional clitic placement, but in Tsotsil (Mayan); while Hale argues for 
syntactic constraints on exceptionality based primarily on Attic Greek, Aissen focuses 
on the role of phonology, and in particular prosodic constraints, in the determination 
of the placement of clitics on the right periphery (rather than in the syntactically con- 
ditioned second position). Like Hale, Deo and Kiparsky examine diachronic shifts in 
Indo-Aryan, albeit toward different ends. Ashwini Deo’s article traces the morphosyn- 
tax of ergative alignment loss in Indo-Aryan. Anderson and others have While much 
work, including a seminal paper by Anderson, has sought to describe how languages 
develop ergative case marking, Deo turns the tables by seeking to develop an under- 
standing of how nominative-accusative marking can emerge in historically ergative sys- 
tems, as in the gradual de-ergativization of Middle Indo-Aryan based on independent 
shifts in argument realization in that language. Paul Kiparsky leverages a synchronic 
analysis of gerunds and nominalizations across a variety of languages (Vedic Sanskrit, 
Finnish, Sakha (Yakut), English) to argue for a lexicalist rather than syntactically derived 
analysis of agent nominalizations; gerunds, on the other hand, are argued to be verbal 
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at all levels of the syntactic derivation. Thus, we have a neatly symmetrical system: 
while gerunds are verbs that bear a Case feature (via their INFL head), transitive nomi- 
nalizations (pace Baker) are nouns with an Aspect feature. The final two papers in the 
(Morpho)syntax component of the volume bear on the status of inflectional morphology 
within the general theory of grammar. Like other authors in the volume, Andy Spencer 
takes as a jumping off point Anderson's “split morphology" hypothesis - the view that 
derivational morphology is lexically mediated while inflectional morphology is syntac- 
tically mediated. Along with the gerunds and nominalizations explored in Kiparsky's 
contribution, another test case for this view is posed by participles. Spencer investigates 
Russian participles and other transpositions utilizing a generalized version of Stump's 
Paradigm Function Morphology. He concludes that while the participle's actual lexical 
entry is that of an adjective (as lexicalist theory predicts), it realizes the verbal proper- 
ties of voice/aspect, sharing the semantic properties and lexemic index of its verbal base, 
just as in the case of verb inflection. In his paper, Greg Stump focuses on the viabil- 
ity of Anderson's conception of rule block for inflectional processes within a grammar 
and on rule interaction. His evidence is provided by multiple exponence in Limbu, a 
Tibeto-Burman language of Nepal, and by polyfunctionality in Southern Sotho, a Bantu 
language of southern Africa. He develops principles of rule conflation that are indepen- 
dently motivated and converge to explain the outcome when a single rule participates 
in more than one block. 

The volume concludes with a set of papers that explore linguistic theory from a broad 
perspective, while also extending the issues of words and rules from the structure and 
evolution of spoken language to the properties of signed languages and birdsong. Mark 
Aronoff, who is (along with Anderson) a key figure in the development of word-based 
morphology, turns his attention here to the mismatch between the nature of linguistic 
rules and that of biological evolution (despite the influence of the results of neogram- 
marians on Darwin and other prime movers in the establishment of the principles of 
natural selection). This mismatch led Saussure to abandon his earlier productive fo- 
cus on diachrony in favor of a single-minded characterization of synchronic linguistics. 
Aronoff borrows Gause's principle of competitive exclusion from evolutionary ecology 
as a means for reviving the evolutionary approaches of Saussure and his 19th century 
predecessors. Besides the dichotomy of diachrony and synchrony, Saussure is also cel- 
ebrated for his distinction between langue and parole, the latter notion often dismissed 
by generative linguists as a matter of peripheral interest. Alan Timberlake revisits Saus- 
sure's treatment of the shift in stress rules from Latin to French and points to the crucial 
role of parole in accounting for the relevant shifts not only in that case but in the al- 
ternations dictating the appearance of intrusive /r/ (the idear of it) and reanalysis (a 
napron » an apron) in English. The papers by Newmeyer and Horvath & Siloni recon- 
sider theoretical issues in generative grammar and its rivals. While parameters have 
long been adduced in grammatical theory as a means to account for cross-linguistic 
variation, Fritz Newmeyer's paper re-examines the role of parametric explanation and 
argues that despite its success in leading generative scholars to the discovery of cross- 
linguistic generalizations, macro- and micro-parameters rest on shaky ground empiri- 


cally and conceptually in current theoretical scholarship. Newmeyer goes on to survey 
some promising alternative approaches for capturing the relevant insights. Julia Hor- 
vath and Tal Siloni compare two theoretically distinct approaches to idioms. They argue 
that the distributional differences between phrasal and clausal idioms with respect to the 
diatheses with which they co-occur represent a serious problem for the approach of Con- 
struction Grammar, which treats linguistic knowledge as a network of stored construc- 
tions, while generative grammar, which allows for a computational system (providing 
unstored derivational outputs) as well as a storage module, is more capable of offering 
an empirically adequate account of these phenomena. 

The of iconicity in sign languages representations is explored in the papers by Lepic & 
Padden and Napoli. On the a-morphous theory of word structure proposed by Anderson, 
the word and not the morpheme is the fundamental building block of morphology. As 
earlier posited by Jackendoff and Aronoff, the task of Word Formation Rules is analysis 
rather than creation; the speaker’s knowledge encompasses the complex relationships 
among the words in her lexicon. Ryan Lepic and Carol Padden provide further support 
for this theory by drawing on the properties of lexical iconicity (and the gradual loss of 
iconicity) in American Sign Language. Donna Jo Napoli’s paper also deals with iconic- 
ity in non-spoken linguistic modalities, although she extends her domain from ASL to a 
wide range of the world’s signed languages. She uncovers biologically motivated “chains 
of iconicity” consisting of mappings from perceptual systems into visual realizations that 
evolve along semantic lines, a natural development that helps explains why unrelated 
signed languages exhibit similar signs for abstract concepts. The final contribution is 
by Louis Goldstein, a pioneer in the development of articulatory phonology, a gesture- 
based approach to the problem of how messages in the mind of the sender are transferred 
to the mind of the receiver through actions of the body. In his paper, Goldstein explores 
the ways such messages are conveyed by humans (through spoken or signed words) and 
by birds; his study demonstrates that despite their real and instructive differences, bird- 
song, like speech, is a complex motor behavior with an essential role played by temporal 
patterning. 

The word has always been at the crux of descriptive and theoretical research on phon- 
ology, morphology, syntax, and semantics. By looking into words, the contributors to 
this volume shed light on both specific subfields and the interfaces among them, on how 
languages work and how they change over time. The papers in this volume, in address- 
ing linguistic structure and linguistic relationships, help further our understanding of 
individual languages and of language as a unified phenomenon. 
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Part I 


Phonology 


Chapter 1 


Between natural and unnatural 
phonology: The case of cluster-splitting 
epenthesis 


Juliette Blevins 
The Graduate Center, CUNY 


A widely recognized feature of loan-word phonology is the resolution of clusters by vowel 
epenthesis. When a language lacking word-initial clusters borrows words from a language 
with initial #TRV sequences, T an oral stop and R a liquid, it is common to find vowel 
epenthesis, most typically vowel-copy, as in, for example: Basque <gurutze> ‘cross’ from 
Latin <cruce(m)>; Q’eqchi’ <kurus> ‘cross’ from Spanish <cruz> ‘cross’, or Fijian <kolosi> 
‘cross’ from English <cross>. The phonological rule or sound change responsible for this pat- 
tern is sometimes called “cluster-splitting epenthesis”: TRV; > &TV(jRV;. The most widely 
accepted explanation for this pattern is that vowel epenthesis between the oral stop and the 
following sonorant is due to the vowel-like nature of the TR transition, since #TRV; is per- 
ceptually similar to ST Vu RV: A fact not often appreciated, however, is that cluster-splitting 
epenthesis is extremely rare as a language-internal development. The central premise of 
this chapter is that #TRV; in a non-native language is heard or perceived as #T Vj) RV; when 
phonotactics of the native language demand TV transitions. Without this cognitive compo- 
nent, cluster-splitting epenthesis is rare and, as argued here, decidedly unnatural. 


1 Introduction 


Diachronic explanations have been offered for both natural and unnatural sound pat- 
terns in human spoken languages. Building on the Neogrammarian tradition, as well 
as the experimental research program of Ohala (e.g. 1971; 1974; 1993), it is argued that 
natural sound patterns, like final obstruent devoicing, nasal place assimilation, vowel 
harmony, consonant lenition, and many others, arise from regular sound changes with 
clear phonetic bases (Blevins 2004, 2006, 2008, 2015; Anderson 2016). Unnatural sound 
patterns, in contrast, cannot be explained in terms of simple phonetic processes. Case 
studies of unnatural patterns show more complex histories: in some cases more than one 
natural change has been collapsed; in others, a natural change is reanalyzed or inverted; 
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in still others, analogy has extended a sound pattern whose origins are morphological, 
not phonological; and combinations of all of these paths can also be observed (Bach 
& Harms 1972; Anderson 1981; Buckley 2000; Vaux 2002; Blevins 2008b,a,c; Garrett & 
Blevins 2009; Anderson 2016). 

However, typological study of regular sound change reveals certain kinds of sound 
change that are neither wholly natural nor wholly unnatural. For example, the shift 
of *#kl > st, documented in at least three different language families, appears to have 
a natural basis in perception, since [kl] and [tl] clusters are acoustically similar and 
confused with each other. However, the rarity of this sound change is associated with 
structural factors: misperception of [kl] as [tl] is strongly associated with the absence 
of phonological /tl/ clusters in a language. In this case, the absence of a sound pattern, 
/tl/, influences cognition, making listeners more likely to perceive [kl] as [tl] (Blevins & 
Grawunder 2009). The presence of a contrast can also facilitate particular types of sound 
change. As first argued by de Chene & Anderson (1979), compensatory lengthening 
sound changes are strongly associated with pre-existing vowel length contrasts. This 
statistical tendency is argued to arise from phonetically natural vowel lengthening in 
pre-sonorant and open syllable contexts (Kavitskaya 2002), combined with the cognitive 
effects of structural analogy, where pre-existing categories act as attractors in the course 
of language acquisition (Blevins 2004: 150-155 and Kavitskaya, this volume). 

In this contribution, I offer another example of regular sound change that is neither 
wholly natural nor wholly unnatural, and highlight the role of human cognition in cases 
where it has occurred. Cluster-splitting epenthesis is of interest, not only because of its 
rarity as a regular sound change, but in how it advances our understanding of sound 
patterns compared to 20th century models (Anderson 1985), and emphasizes the extent 
to which historical linguists, phonologists, and phoneticians still need the cognitive sci- 
entist (Anderson 2001). 


2 Cluster-splitting epenthesis in loanword phonology 


A widely recognized feature of loanword phonology is the resolution of clusters by vowel 
epenthesis. When a language lacking word-initial clusters borrows words from a lan- 
guage with initial #TRV sequences, T an oral stop and R a liquid, it is common to find 
vowel epenthesis, most typically vowel-copy, as illustrated in (1).! 

If loanword phonology is taken as evidence for properties of phonological grammars 
(Hyman 1970), a phonological rule describing this pattern can be stated as in (2). 


! For these and other examples, see: Blevins & Egurtzegi (2016) on Latin loans in Basque; Casparis (1997) 
on Sanskrit loans in Indonesian; Campbell (2013) on Colonial Spanish loans into Mayan languages; and 
Kenstowicz (2007) on Fijian loanword phonology. A reviewer notes that some varieties of Q'eqchi' permit 
initial /CR/ clusters. Proto-Mayan had only *CVC syllables, and Mayan kurus arguably reflects borrowing 
of Colonial Spanish cruz into a language which lacked initial CR clusters. 


1 Between natural and unnatural phonology: The case of cluster-splitting epenthesis 


(1) Cluster-splitting epenthesis in loanword phonology 


a. Source language Latin crucem ‘cross’ 
Target language Basque gurutze ‘cross’ 

b. Source language Sanskrit klēśa ‘defilement’ 
Target language Indonesian kelesa ‘indolent’ 

c. Source language Spanish cruz ‘cross’ 
Target language Q'eqchi’ kurus ‘cross’ 

d. Source language English cross ‘cross’ 
Target language Fijian kolósi ‘cross’ 


(2) Cluster-splitting vowel-epenthesis 
#TRV; = STV(G)RV; 


Within the structuralist and generative traditions detailed in Anderson (1985), the lo- 
cus of explanation for this type of epenthesis lies in phonotactic differences between the 
source language and the target language. Under this general account, the speaker of the 
target language hears a word pronounced in the source language, constructs a phono- 
logical representation with an initial #TR cluster based on this hearing, but then alters 
this phonological representation in line with the phonotactics of the speaker's native 
language which lacks initial #TR clusters (e.g. Broselow 1987; Itó 1989). 

Typological studies of loanword phonology and advances in our understanding of 
speech perception have given rise to 21st century treatments of these patterns that are 
more explanatory in accounting not only for a "repair" but for the specific type of sound 
pattern that results. At present, the most widely accepted explanation for the sound 
pattern in (2) combines two new findings in speech perception, one related to percep- 
tual similarity, and the other related to perceptual illusions. A first component of the 
analysis is that vowel-epenthesis between the oral stop and following sonorant is due to 
the vowel-like nature of the TR transition (Fleischhacker 2001; 2005; Kang 2011; Berent 
2013; Broselow 2015). Fleischhacker (2001; 2005) argues that the general pattern is de- 
termined by perceptual similarity: initial TR clusters are more perceptually similar to 
TVR than VTR. An important aspect of her work is the distinction between initial «TR 
clusters and initial #sT clusters, which rarely show vowel-splitting epenthesis, and defy 
purely phonotactic accounts. A second component of the analysis relates to specific 
structural differences between the source and target languages. Under the perceptual ac- 
count, perception of #TR by native speakers of languages that lack initial #TR is biased: 
these speakers will tend to hear a vowel between the oral stop and the following liquid, 
even if no vowel is present. Experimental work supporting the existence of illusionary 
vowels for Japanese speakers presented with CC clusters was presented in Dupoux et al. 
(1999), and has been supported by much subsequent work (see Peperkamp & Dupoux 
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2002; Kang 2003; 2011; Kabak & Idsardi 2007; Davidson & Shaw 2012), including a range 
of studies showing vowel percepts in TR clusters (Berent 2013, and works cited there).? 

Given this evidence, one might conclude that the sound pattern described in (2) is 
both natural and common, having a clear phonetic explanation (Blevins 2004; 2008b; 
2015). As a natural, phonetically-based sound pattern one might expect many instances 
of reconstructed word-initial "TR clusters to be resolved by a sound change parallel to 
the synchronic rule in (2). A sound change of this kind might be even more common 
than expected on phonetic grounds due to markedness proposals stating that complex 
onsets are less preferred than simple onsets (Prince & Smolensky 1993; Kager 1999). As I 
show below, these expectations are not borne out, suggesting that cognitive bias in the 
context of novel stimuli plays a central role in cluster-splitting epenthesis. 


3 Cluster-splitting epenthesis as regular sound change 


Very few well-studied and widely agreed upon proto-languages are reconstructed with 
initial “TR clusters at the oldest stages. One exception is Proto-Indo-European, recon- 
structed with "TR clusters as well as other initial cluster types (Fortson 2010: 64-65). 
Some widely agreed upon Proto-Indo-European reconstructions with initial "TR clus- 
ters are shown in (3). 


(3) Word-initial *TR in Proto-Indo-European 


a. "gras- ‘eat’ Cf. Vedic grásate ‘eats, feeds’, Greek grástis ‘green 
fodder, grass’, Latin gramen (< “gras-men) ‘grass, fod- 
der’ 


b. *preki- ‘ask’ ^ Cf Vedic prccháti ‘asks’, Latin precor ‘I entreat’, Ger- 
man fragen, Tocharian B prek- 


c."trejes ‘three’ Cf. Lycian tri-, Vedic tráyas, Greek treis, Avestan 
Orayo, Latin tres 


The Indo-European language family is relatively large, relatively diversified, and rela- 
tively well-studied in comparison with other language families of the world. According 
to Ethnologue, there are approximately 445 living Indo-European languages at present, 
and linguists agree that the major subgroups of Anatolian, Indo-Iranian, Greek, Italic, 
Celtic, Armenian, Tocharian, and Balto-Slavic have had long independent developments. 
If cluster-splitting vowel-epenthesis in (2) has a natural phonetic basis in perceptual sim- 
ilarity as outlined above, then a sound change like (3) might be expected to have occurred 
numerous times in the Indo-European language family. 


(4) Cluster-splitting vowel-epenthesis as sound change 
#TRV; > 4TV(j)RV; 


? This approach is not agreed upon by all researchers. See Uffmann (2007) and Hall (2011) for discussion. 
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However, cluster-splitting vowel epenthesis as a regular sound change is rare in the 
Indo-European language family. “TR clusters are inherited intact in all of the major 
subgroups, and sound changes affecting these clusters at later stages of development are 
of distinct types (e.g. palatalization of "lm Romance *Tl clusters; loss of *p in Celtic). 

Indeed, within the entire Indo-European language family, there appears to be only 
one clear instance of a regular sound change like (3)? The sound change in question 
appears to have occurred in relatively recent times, in the transition from Middle Per- 
sian to Modern Persian (aka New Persian, Farsi, Dari, Tajiki), or, perhaps more generally, 
from Middle to Early New Iranian.* While the specific sound change is rarely stated as 
in (3), it is implied. For example in their chapter on modern Persian and Tajik phonology, 
Windfuhr & Perry (2009: 427—428) describe the language as having syllable onsets con- 
sisting of only a single consonant, and note that "The inherited initial clusters have been 
resolved by prothetic or epenthetic vowels, either of which could become standardized, 
e.g. st-: star ‘star’ > setàre/sitora, br: bradar ‘brother’ > baradar/barodar..." (Windfuhr & 
Perry 2009: 428). The epenthesis process described is identical to that schematized in (3), 
and it is also characteristic in loanword phonology, on which much more has been writ- 
ten (see, e.g. Strain 1968, Karimi 1987). Illustrative examples comparing Middle Persian 
inherited clusters to Modern Persian #CVC sequences are shown in (5). 


(5) One case of cluster-splitting epenthesis in Indo-European: Modern Persian 


Middle Persian Modern Persian gloss PIE 

a. bradar baradar ‘brother’ *bhréh,ter- 

b. griftan gereftan, ‘grab, take’ ` "g'rebh;- 
giriftan 

c. draxt daraxt ‘tree’ *drew- ‘wood’ 

d. griy- geri- ‘to cry’ *g?reh;d- 


A second case of cluster-splitting epenthesis sound change is found in the Siouan- 
Catawba language family, a small group of languages in North America that includes 
Crow, Hidatsa, Mandan, Lakota, Dakota, Assiniboine, Yanktonai, Stoney, Sioux Valley, 
Chiwere (aka Iowa-Missouria-Otoe), Hoocak (aka Winnebago), Omaha-Ponca, Ponca, 
Kanza/Kaw, Osage, Quapaw, Biloxi, Ofo, Tutelo, Saponi, Catawba and Woccon. The 
diachronic process known as Dorsey's Law (Dorsey 1885) is a sound change taking Proto- 


3 Fortson (2010: 302-303) mentions evidence for a sound change similar to cluster-splitting epenthesis in 
Oscan, an extinct Italic language known from inscriptions from approximately 400 BC - 100 CE, as in 
aragetud ‘with money’ (cf. Lat. argento) and sakarater ‘it is consecrated’ (cf. Lat. sacratur). However, the 
“rg cluster continued in aragetud was arguably heterosyllabic RT (as opposed to tautosyllabic TR) and 
initial TR clusters are continued intact in Oscan as in trístaa, trííbüm, prüfatted (op cit.). 

^ [n his discussion of East and West Iranian dialectology, Windfuhr (2009: 21) states the reflexes of initial 
#CC-clusters as showing a distinct areal distribution: “insertion of a short vowel, CVC-, along the Zagros, 
including the NW tier I from Kurdish, Zazaki to the SW Fars and Larestan dialects, as opposed to initial 
vowel, VCC-, elsewhere", while in the east, Balochi and most East Iranian languages allow initial clusters. 
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Siouan *TRV to #TV;RV; in Hoocak (aka Winnebago)? Examples from Rankin et al. 
(2015) are shown in (3). 


(6) One case of cluster-splitting epenthesis in Siouan: Dorsey’s Law in Hoocak 


Chiwere Hoocak Proto-Mississippi-Valley gloss 
a.églufi ^ waki/kununi *krüri ‘forget’ 

b. glé keré “kre ‘go back to’ 
c.wa/brü  ru/purü ‘plough’  *prü ‘powder’ 


While the time-depth of Siouan-Catawba is thought to be 2,000-3,000 years (Parks & 
Rankin 2001), Hoocak and Chiwere are considered to be closely related and even some- 
times treated as dialects of a single language (Miner 1979).° Given this, Dorsey's Law 
must be a relatively recent development. 

Outside of the Persian and Hoocak cases, it is difficult to find convincing cases of 
cluster-splitting epenthesis as a diachronic development. And here lies the central point 
of interest. Given that cluster-splitting epenthesis is common in loanword phonology 
(2), and appears to be a natural phonetically-motivated process, why is it rarely attested 
as a regular sound change? Why, out of more than 440 Indo-European languages, is 
there only one clear case of a #TRV; > «TV(j))RV; sound change? And how should we 
understand the Siouan sister-languages Chiwere and Hoocak, where Chiwere continues 
#TRV, but Hoocak does not? 

Isuggest that cluster-splitting epenthesis is neither wholly natural nor wholly unnatu- 
ral: non-phonetic structural and cognitive factors are involved. The structural condition 
is that cluster-splitting epenthesis occurs only when speakers of a language that lacks 
initial TR clusters begin to acquire a language that has initial TR clusters. It is only under 
this circumstance that the perceptual illusion of #TRV as #TVRV arises (cf. Dupoux et al. 
1999), with this perceptual illusion constituting the cognitive catalyst for phonological 
change. An important component of this model is that regular sound changes of this 
kind will only occur under special types of language contact, where speakers dominant 
in a language that lacks initial consonant clusters suddenly (or without extensive expo- 
sure) acquire a language with #CR-clusters.’ If extensive exposure occurs, perceptual 
illusions of phantom vowels will weaken, lowering the probability of epenthesis as reg- 
ular sound change. Let us now evaluate this proposal with respect to the two cases of 
diachronic cluster-splitting epenthesis documented above. 


5 Dorsey's Law also refers to the resulting synchronic sound pattern in Hoocak. It also applies to medial 
clusters. Since the syllabification of medial TR is ambiguous cross-linguistically, discussion is limited here 
to initial TR where, at least utterance-initially, sequences constitute unambiguous complex onsets. 

é Miner (1979: 25) begins his article with the statement that: “Winnebago and Chiwere ... are, in spite of 
their geographical separation in historical times, very closely related and enjoy a high degree of mutual 
comprehensibility" He also notes on the same page (footnote 1) that “Winnebago-Chiwere is sometimes 
referred to in the literature simply as Chiwere? 

7 For a similar proposal regarding paragoge (final vowel insertion), see Ng (2015), a dissertation supervised 
by Steve Anderson. 


1 Between natural and unnatural phonology: The case of cluster-splitting epenthesis 


Modern Persian phonology has had significant influence from Arabic and Turkic. Ara- 
bic loans constitute about half of the lexicon, and some estimate that of the most frequent 
vocabulary, at least 25% is Arabic (Perry 2004; 2005). Turkic loans also exist and there is 
a long history of Persian-Turkic bilingualism as well as Turkic “Persianization”. Could 
acquisition of Persian by Arabic or Turkic speakers be the source of Modern Persian 
cluster-splitting epenthesis? I believe the answer is yes. More specifically, I suggest 
that the Persianization of Turkic people, such as the one occurring during the Ghaz- 
navid dynasty (977-1186), and extending over large parts of Iran, was a critical factor in 
the evolution of cluster-splitting epenthesis in Modern Persian. Turkic languages have 
phonotactics that appears to be most important in triggering cluster-splitting-epenthesis: 
they disallow complex onsets in word-initial position (and elsewhere). Under this sce- 
nario, Middle Persian underwent rapid phonological change, as it was acquired by na- 
tive speakers of Turkic languages across Iran. How early the process began is unknown, 
though it could have begun as early as the 10th century when Turkic speakers came to 
the area, or in the 11th and 12th centuries, when a large migration of Oghuz Turks re- 
sulted in the gradual “Turkification” of Azerbaijan and Anatolia (Frye 2004). Key (2012), 
who focuses on morphosyntactic effects of contact, suggests that Turkic influence may 
date from the Safavid state (1501-1736) “the rulers of which were Persianized Turks who 
spoke a variety of Middle Azerbaijanian that might actually have been a mixed language 
incorporating Ottoman elements (Stein 2005: 228).”8 Frye (2004) presents a distinct view 
of the Safavids as Turkicized Iranians, but most seem to agree that it was the post-Islamic 
migration of Turks, as opposed to Arabic speakers, that had the most linguistic influence 
in the area: “ ...the Turks who came, especially beginning from the tenth century, moved 
in sufficient numbers to change the linguistic map of the whole area. (op cit.)” 

Though Classical Arabic also disallows onset clusters, there are several reasons to 
doubt Arabic as the source of cluster-splitting epenthesis in Modern Persian. First, ev- 
idence from early loans into Classical Arabic shows common prothetic vowels, with 
epenthesis the exception (cf. Arabic 7iklil ‘crown , wreath’ from Syriac klilo, Arabic 
figlim ‘region’ from Greek klima; but also Arabic dirham ‘money’ from Greek drakhmi; 
Bueasa 2015). Second, the influence of Arabic on Middle Iranian languages came, primar- 
ily, through translation of religious texts into Arabic, and through acquisition of Arabic 
by writers and thinkers who used it as a prestige language. This socialization process 
was notably different from the Persianization of Turkic people referred to above, and 
resulted in significant loans, but no obvious evidence of Arabic influence on Persian 
grammar. 

I hypothesize that cluster-splitting epenthesis in the history of Persian arose as a result 
of contact between speakers of Turkic languages, which did not allow complex onsets, 
and speakers of Middle Iranian languages with initial #TR-clusters. As Turks became 
Persianized, they acquired Persian (and, perhaps, other Middle Iranian languages). In 
this process, cognitive effects of CV(C) syllable structure resulted in the perception of 
illusory vowels in #TR-initial words (cf. Dupoux et al. 1999), giving rise to the change in 


8 Key’s (2012) study of differential object marking in Turkic and Persian identifies Iranian Azerbaijan as an 
isogloss for this feature. 
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pronunciation schematized in (3). Under this account, the rarity of sound changes like 
(3) is attributed to three factors: first, initial #TR clusters are relatively stable over time, 
so (3) is unexpected as a language-internal development; second, a sound change of this 
kind requires contact between two distinct language types, one language which lacks 
complex onsets and another which has word-initial #TR; a third factor is the nature of 
the language contact involved, which must include social factors that demand rapid and 
full acquisition of the language with #TR clusters despite minimal previous exposure.’ 
Only when these last two conditions are met will cluster-splitting epenthesis occur as a 
regular sound change.” 

Can the same hypothesis involving language contact of a very specific type account 
for the evolution of Dorsey’s Law in Hoocak (Winnebago)? I believe so. Oral histories 
suggest that the split between Hoocak, traditionally spoken between Green Bay and 
Lake Winnebago in present-day Wisconsin, and Chiwere, once spoken south and west 
of Hoocak territory, occurred sometime in the mid-16th century, a time-line consistent 
with the great similarity between the two languages." This would make the mid-16th cen- 
tury the earliest time at which Hoocak could have developed cluster-splitting epenthesis, 
an innovation not found in Chiwere (3). By the time Jean Nicolet made contact with the 
“Ho-Chunk” in 1634, with an estimated population of 8,000 or more, their culture was 
very similar to that of surrounding Algonquian tribes, they were completely encircled by 
speakers of Algonquian languages, and the language had a significant number of borrow- 
ings from Central Algonquian languages (Radin 1990; Pfister 2009: 17).? I suggest that 
sometime between the mid-16th and mid-17th centuries, (pre-)Hoocak was acquired by 
speakers of neighboring Algonquian languages. Since none of the Central Algonquian 
languages had initial #TR clusters, cognitive effects of #CV(C) syllable structure resulted 
in the perception of illusory vowels in #TR-initial words (cf. Dupoux et al. 1999), giving 
rise to Dorsey's Law. As with the contact scenario sketched for Modern Persian above, 
the evolution of cluster-splitting epenthesis is associated not only with these structural- 
cognitive factors, but also with a specific type of language contact: external social factors 
demanding rapid and full acquisition of a language, (pre-)Hoocak, with initial #TR clus- 
ters by speakers of a language Central Algonquian language with only simple #C-onsets 
word-initially. 


? This process is distinct from creolization, since the starting point here is not a pidgin. Interestingly, many 
Creoles show initial complex onsets (Klein 2013), consistent with the view here, that they are relatively 
stable, and not particularly “marked”. 

10 An anonymous reviewer notes that if future generations have access to the donor language, and that 
language is prestigious, one may see a shift involving adoption of the donor phonotactics. 

H Though the homeland of the Siouan-Catawba language family is widely debated, oral histories and arche- 
ological remains are consistent with (pre)-Hoocak occupation of land between Green Bay and Lake Win- 
nebago (in present-day northeast Wisconsin) in pre-contact times. 

12 By the late 1650s, the Hoocak population may have been as few as 500 people, with great cultural dev- 
astation. This drastic decrease in population is attributed to a storm-related accident, epidemics (due to 
European contact), and/or battle losses to neighboring tribes (Edmunds 1978; Radin 1990). 
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4 Concluding remarks 


The typology of sound change may seem like an odd place to uncover significant evi- 
dence of cognitive forces that are independent of universal phonetics, or evidence against 
widely assumed markedness constraints.” Yet, this study of cluster-splitting epenthesis 
as regular sound change suggests that typological studies of this kind may illuminate 
our understanding of the role of human cognition in shaping sound patterns, and the 
extent to which general aspects of memory, category formation, similarity metrics, and 
analogy contribute to their evolution (Blevins & Blevins 2009). Contrary to widely as- 
sumed markedness constraints treating all complex onsets as marked or dispreferred, 
the typology of sound change suggests that word-initial #TR clusters are phonotacti- 
cally stable. On the other hand, in the rare cases where these clusters undergo regu- 
lar cluster-splitting epenthesis, this epenthesis is not a simple case of "syllable-repair". 
Rather, native-language #CV-structure in language-contact situations results in the per- 
ception of phantom vowels which take on phonological status when speakers of #CV- 
initial languages must quickly, and with little earlier familiarity, acquire a language with 
#TR clusters. This, I suggest, was the original situation of Turkic speakers acquiring Per- 
sian, and of Central Algonquians acquiring Hoocak. Unlike many other common sound 
patterns, regular cluster-splitting epenthesis does not have a simple phonetic explana- 
tion, and is not known as a purely language-internal development. By examining other 
sound changes with this profile, we may, unexpectedly, learn even more about the hu- 
man mind. 
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Chapter 2 


The domain of stress assignment: 
Word-boundedness and frequent 
collocation 


Ellen Kaisse 
University of Washington 


Phenomena that a theory of the human 
language faculty ought to accommo- 
date might well happen never to be 
attested because there is no course of 
change or borrowing by which they 
could arise. (Anderson 1992: 336) 


The linguistic literature treats hundreds of processes that apply between adjacent, open class 
content words. Overwhelmingly, these processes are local, segmental adjustments applying 
between the last segment of one word and the first segment of the next, such as voicing or 
place assimilation. Most other kinds of processes are profoundly underrepresented. Notably, 
iterative processes that eat their way across words such as vowel harmony, consonant har- 
mony, or footing (assignment of rhythmic stress) typically do not extend beyond the word 
or count material outside the word when locating a particular stressable syllable, such as 
the penult. This result becomes more understandable when one considers how processes 
are phonologized from their phonetic precursors. Precursors are articulatorily or percep- 
tually motivated and are strongest in temporally adjacent segments; they lose their force 
as the distance between segments increases, so there are no strong precursors for iteration 
outside of a word. Furthermore, frequent repetition leads to phonologization (Bybee 2006). 
Any content word in a lexicon occurs next to any other content word much less frequently 
than do roots plus their affixes, hosts plus clitics, or any combination that includes at least 
one closed class item. So we should expect roots and affixes or hosts and clitics to be much 
more common as domains for stress assignment or other iterative rules than are strings of 
independent lexical items. In this paper, I concentrate on the near non-existence of stress 
assignment rules that span a domain larger than a morphological word plus clitics. We look 
at one revealing case that does treat some pairs of content words as a single span: stress 
assignment in literary Macedonian (Lunt 1952). The spans involve frequent collocations — 
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groups of already frequent words that are frequently heard together, sometimes to the ex- 
tent that they have developed a lexicalized meaning. The other interword Macedonian cases 
involve closed class item such as prepositions and interrogatives. Finally, we consider ev- 
idence that rhythm can be evidenced statistically in the syntactic choices speakers make 
(Anttila, Adams & Speriosu 2010; Shih 2016), concluding that there may be rhythmic pres- 
sures from the lexicon to make phrases eurhythmic as well as eurhythmic pressures from 
phrases that can be phonologized to create rules of lexical stress assignment. 


1 Introduction: what kinds of processes are postlexical 
and where do stress rules fit in? 


Some types of phonological processes seem always to apply solely within a single word, 
not taking into account any material in adjacent words, while other types can apply 
between words. In work to appear (Kaisse Forthcoming) I surveyed the literature on 
lexical (word-bounded) and postlexical (non-word-bounded) rules, sampling about 70 
careful descriptions of non-tonal processes that make up their focus and environment 
from material that spans more than one content word.! Only a few involve anything 
other than strictly local adjustments between the last segment of one word and the first 
segment of the next. They might require agreement in voicing, place of articulation, or 
other features. Or they might avoid onset-less syllables by deleting or desyllabifying a 
word final vowel when the next word begins with a vowel, or by moving a word final 
consonant into the onset of the next, vowel-initial word. (1) contains some fairly familiar 
examples from Spanish. (1a) illustrates postlexical voice assimilation of /s/ to [z] before 
voiced consonants and assimilation of continuancy (/g/ — [y] and /b/ — []) after that 
/s/; (1b) shows place assimilation of a nasal to a following consonant and the retention 
of an underlying stop after non-continuant /n/; (1c) shows reduction of hiatus; and (1d) 
shows resyllabification of a word-final consonant: 


(1) Spanish (personal knowledge) 
a. Jos gato-s bjen-en/ — [loz yatoz jenen] 
the cat-PL come-3PL 
"Ihe cats are coming: 
b. /bjen-en ‘gato-s/ — ['bjenen gatos] 
come-3PL cat-PL 


‘Cats are coming: 


! Some tonal processes are well known to span large numbers of syllables, both within and between words 
(Hyman 2011), though many are also word-bounded. In Kaisse (forthcoming) I endorse David Odden's (p.c. 
2015) speculation about the long- distance behavior of tonal adjustments in Bantu languages, which offer 
the most numerous and robust examples of processes where several tones in one word can be affected by 
a distant tone in an adjacent word. Odden cites the perceptual difficulty of locating tone in Bantu, with its 
long words and subtle cues for tone, and tone's low functional load there, since only H, not L tone needs 
to be marked. 
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c. /teg-o ‘otro-s/ — [ter otros] 
have-1sc other-Pr 
‘T have others: 

d. /tjen-es ‘otro-s/ —» ['tje.ne.'so.tros] 
have-2sc other-PL 


"You have others? 


Really, any local adjustment that is found within words can be found between words. 
This exuberance of types is probably due to the fact that most phonologized processes 
start life as natural local effects and these effects are not sensitive to grammatical in- 
formation but rather to temporal adjacency (Kiparsky 1982 et seq.). However, I found 
almost no vowel harmony processes that extended into an adjacent content word, no 
consonant harmony processes (Hansson 2010), and, crucially for the current paper, no 
stress rules with a domain larger than the morphological word or the morphological 
word plus clitics. Compare the familiar types of examples in (1) with the fanciful ones in 
(2), where something resembling the English rule that constructs moraic trochees from 
the end of a word takes a whole noun phrase as its domain, resulting in feet that span 
syllables belonging to different words and in wide-scale allomorphy: 


(2) Fanciful English with noun phrase as initial domain of footing 


a. x 
(xx .) 
/taj ni dag / 
[ taj nidog] 
‘tiny dog’ 


x 
(x .Xx .) 
/taj ni krten/ 
[.tajnikiron] 
‘tiny kitten’ 
C. x 
69x x .) 
/taj ni pr 1a na/ 
[ taj nipa'1ana] 
‘tiny piranha’ 


In this paper, I will describe the continuum of stress behavior of cohering and non- 
cohering affixes (i.e. affixes that do and do not interact phonologically with their bases), 
clitics of varying degrees of stress-interaction with their hosts, compound words, and the 
one detailed presentation of a stress rule I have encountered where initial stress assign- 
ment takes some strings of content words and treats them as a single domain, ignoring 
any stress the component words might otherwise have in isolation. That case comes 
from literary Macedonian (Lunt 1952; Franks 1987; 1989). 
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Before continuing, I should make it clear that there are many rhythmic adjustments 
that do apply between content words - cases like the Rhythm Rule in English (Hayes 
1984) which is responsible for the retraction of the secondary stress in Japanese language 
(vs. Japanese) or Mississippi mud vs. Mississippi. These kinds of cases are postlexical 
and involve prosodically self-sufficient content words that have had their own stresses as- 
signed lexically, independent of the larger context in which they find themselves. There 
is then a rhythmic adjustment that demotes or moves a nonprimary stress in order to 
avoid stress clash. Like the invented example (2), the Macedonian case that I will look 
at instead involves assigning a single antepenultimate stress to a string of two content 
words which, in other syntactic contexts, would each receive a stress of their own. Often 
the single stress does not fall on any of the syllables that would have been stressed in 
isolation. This is clearly different from the way a rhythm rule works. 

Another example of postlexical stress adjustment, as opposed to the first pass of stress 
assignment, involves compound stress rules. Again, these are not relevant to my claim 
that pairs of content words are almost never the domain of the first pass of stress as- 
signment. The most well-known of these compound stress rules, like that of English, 
also simply adjust the relative primary stresses of the members, rather than treating 
them as a single unit for the lexical footing process. Thus, if we put together linguistics 
and scholar we get linguistics scholar, with the primary stress of scholar subordinated 
to that of linguistics, but we do not stress the whole string as a single prosodic word, 
which might result in antepenultimate stress falling on the last syllable of the first mem- 
ber, with the bizarre (for English) output linguistics scholar. We shall see, however, that 
occasionally languages (such as Modern Greek) do take compound words as the initial 
domain of stress assignment. So while prosodically independent content words are not 
commonly the domain for the first pass of stress assignment, some languages stretch 
that domain to include both members of a compound word. 

Yet another aspect of the postlexical adjustment of stress is offered by Newman's (1944: 
28-29) insightful early discussion of stress domains in Yokuts, pointed out to me by 
an anonymous referee. Nouns and verbs maintain their lexical stresses in connected 
speech but function words tend to lean on them and to lack stresses of their own, and 
the faster the speech, the bigger the phrasing groups and the fewer the perceived stresses. 
Newman's description is perforce rather general and impressionistic, but it summarizes 
well the plasticity of postlexical stress adjustments, which favor cliticization of function 
words and variable rhythmic groupings dependent on tempo and number of syllables.? 

While truly grammaticized rhythmic stress assignment almost never takes an initial 
domain beyond a word and its affixes and clitics, there are now known to be gradient 
rhythmic effects that extend beyond the word. They simply don't seem to be able to 
rise to the level of phonologization in that larger domain. Martin (2007), Anttila, Adams 
& Speriosu (2010), and Shih (2016) have found that vowel harmony (Martin) and opti- 
mization of rhythm can have statistical reflections in Turkish compound words and in 


? I cannot tell from Newman's description how content words that are not nouns or verbs might behave. I 
tentatively conclude that he is referring only to lack of stress on function words in rapid connected speech, 
not to complete loss of stress on open class modifiers or other content words. 
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English word order. That is, to use Martin’s felicitous phrasing, lexical generalizations 
that are part of the grammar can “leak” into larger domains, causing statistical prefer- 
ences for outputs that are consonant with the phonology of a language’s words. I would 
add that it is hard to know what the primary direction of leakage really is: the leakage 
Martin and Shih posit from smaller domains to larger ones probably co-exists with the di- 
rection of larger-to-smaller domain phonologization, which I posit in Kaisse (Forthcom- 
ing) and which is the staple view of Lexical Phonology and its descendants (Kiparsky 
1982; Bermudez-Otero 2015) and of most traditional approaches to sound change. In 
Kaisse (Forthcoming) I used the example of the phonetic precursor of vowel harmony, 
namely the vowel-to-vowel coarticulation that peters out as one gets farther away from 
the source vowel (Ohman 1966; Ohala 1992; Flemming 1997), which can be grammati- 
cized within words because stems and their affixes are in frequent collocation (Bybee 
2006)°. Classical Lexical Phonology postulates this one-way direction (from postlexical 
and variable or gradient to lexical and regular and obligatory), but there is no reason 
that once a rule is lexicalized, it cannot then generalize postlexically again (Kaisse 1993). 
One can imagine a feedback loop, where small postlexical variation in favor of alternat- 
ing stresses and avoidance of stress clash starts to be phonologized as iterative footing, 
while iterative footing starts to make syntactic phrases that are eurhythmic more desir- 
able as choices for speakers in real time. 

There is a continuum of likely domains for foot-construction. Selkirk (1995), Peper- 
kamp (1997) and Anderson (2011) demonstrate that there are various types of clitics that 
range from more to less rhythmically cohesive (interactive) with their hosts. I will extend 
the continuum, showing that in literary Macedonian (Lunt 1952; Franks 1987; 1989) the 
lexical stress rule assigning antepenultimate stress has stretched its domain to include 
even two content words, but only when they are in frequent collocation and, in some 
of the cases, have taken on a more lexicalized, semantically less compositional meaning. 
This is the “exception that proves the rule? In general, only the supremely frequent col- 
location of a closed class bound morpheme - an affix - with another affix or the root 
it can attach to provides the frequent collocation that allows for phonologization. How- 
ever, occasionally even two content words can appear together so frequently that they 
become subject to the first pass of the lexical stress rules of the language; they act like a 
single word for the purposes of building a foot at the edge of a word. Clitics and function 
words in Macedonian also form part of the initial domain for this foot-building. This ac- 
cords with the general observation that the phonology of clitics is like the phonology of 
affixes in many cases - they are ‘phrasal affixes’ (Anderson 1992). Clitics are usually less 
cohering than stem-level affixes but, as we shall illustrate in 82, there are even a few that 
act like they are inside the phonological word for the purpose of foot-construction. But 
basic foot building algorithms do not typically extend beyond the morphological word. 


? Barnes (2006) cites Inkelas et al. (2001) for evidence that phonetic vowel-to-vowel coarticulation is prob- 
lematic as a simple, unaided source for vowel harmony. Their argument comes from Turkish, where antici- 
patory phonetic effects are stronger than perseveratory ones, but the phonologized harmony system is per- 
severatory. He instead attributes the phonologization of vowel harmony to vowel-to-vowel coarticulation 
coupled with lengthening of the trigger syllable and paradigm uniformity effects allowing longer-distance 
effects on distant affixes. 


21 


Ellen Kaisse 


Occasionally they extend even less far than that, as in the case of non-cohering (stress 
neutral) suffixes of English, and occasionally they extend into the larger phonological 
word, which can include clitics and other function words that can have stressless vari- 
ants - i.e. be prosodically deficient, in Anderson e (2011)'s terms. Indeed, the failure of 
an affix to participate in the lexical footing domain is one of the main ways in which 
non-cohering affixes have been defined. Note for instance that Chomsky & Halle (1968: 
84ff) use the term “stress neutral” for the group of suffixes which, to anticipate the later 
interpretation of their intentions, are outside the prosodic word. These suffixes, which 
include all the inflectional affixes, as well as a number derivational affixes such as #ly, 
#ness, agentive #er, #ful, and many others, not only fail to affect stress but also don’t 
induce word-internal rules like Trisyllabic Shortening and Sonorant Syllabification. (See 
Kiparsky 1982 for a full discussion of Trisyllabic Laxing and Kaisse 2005 for a summary 
of the characteristics of English cohering suffixes.) Indeed, we might ask why stress 
is one of the most typical diagnostics for cohering suffixes, and not only in English. I 
would speculate that rhythm is hard to grammaticize as a phonological process with- 
out frequent repetition of the same strings. Like clitics, word-level suffixes have less 
stringent subcategorization restrictions. Fabb (1988), which we will discuss in a bit more 
detail in §2 discovered that the possible combinations of English stem-level suffixes with 
other suffixes or with roots are very restricted. On the other hand, word-level suffixes 
can combine freely with many suffixes and words, and therefore are not in as frequent 
collocation as the stem level ones, which are heard over and over again with the same 
preceding morpheme, be it an affix or a root. Clitics are even less demanding of the 
preceding host - indeed in some cases, such as special clitics, the host can belong to any 
part of speech and can even be a phrase - so they are less likely to be phonologized as 
part of the stress domain. But because they are prosodically unable to stand on their 
own, they must lean on a host and thus may sometimes be phonologized as part of the 
stress domain. 


2 Clitics and the stress domain continuum 


There has been considerable attention paid to the various ways that clitics - prosodically 
deficient items that must lean on a host in some fashion - interact with the lexical stress 
assignment rules of a language and fall into their domain. I will summarize some recent 
results here because I believe that the insights from clitics can help us understand how 
compound words and even some phrases can come to behave in the same way as clitics 
do with respect to their hosts. 

It is worth reviewing some of the typical diagnostics of clitics (paraphrased from 
Zwicky & Pullum 1983): 


(3) Characteristics of clitics 


a. clitics have a low degrees of selection with respect to their hosts; affixes have 
a higher degree of selection. 
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b. affixed words are more likely to have idiosyncratic semantics than host + 
clitic combinations 


c. affixed words are treated as a unit by the syntax while hosts plus their clitics 
are not. 


To these we can add that while the boundary between a root and stem-level (cohering) 
affix is the most favorable position for phonological interactions to occur, and the boun- 
dary between base and word-level (non-cohering) suffixes somewhat less favorable, cli- 
tics are sometimes phonologically even less connected - less cohesive - with their hosts; 
they might fail, for instance, to undergo vowel harmony, the last segment of their hosts 
might undergo word-final devoicing, as if there were no following segment, and as we 
shall see, in some languages, the clitics might not be visible to the lexical stress rules or 
at least to the first pass of those rules. 

Beginning with Anderson's (1992) classification of clitics as phrasal affixes as a foun- 
dation, Selkirk (1995) and Peperkamp (1997), elaborated in Anderson (2011), propose a 
hierarchy of clitic types. Ranked from least-cohering to most cohering with respect to 
phonological interaction with their host, we have the continuum in (4): 


(4) The clitic continuum 
prosodic word clitic > free clitic > affixal clitic > internal clitic 


a. Prosodic Word Clitic b. Free Clitic 
PPh PPh 
PWd  PWd PWd 
Host  Clitic Host  Clitic 
c. Affixal Clitic d. Internal Clitic 
PPh PPh 
PWd PWd 
PWd Host  Clitic 


Host  Clitic 


Intuitively speaking, we know that clitics usually fall somewhere in between the phono- 
logical cohesiveness of stem-level, highly cohering affixes and the phonological non- 
cohesiveness of independent prosodic words. But in this elaborated hierarchy, not all cl- 
itics do. There are clitics termed ‘internal’ by Selkirk that act, at least for some processes, 
exactly like stem-level affixes. We will extend this notion to the compound words of 
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Modern Greek and the extended stress domains of Macedonian - these involve normally 
prosodically independent words that nonetheless act as a single domain and receive one 
stress as if they were a single word. On the other end, there are clitics that Selkirk 
terms ‘prosodic word clitics;’ these are forced in some cases to behave like independent 
prosodic words despite their underlying prosodic deficiency. In (5), I illustrate the in- 
ternal type with the example of verbal clitics in Formentera Catalan (Torres-Tamarit & 
Pons-Moll 2015), where the whole verb-clitic complex is used to assign moraic trochees 
from the right. This can result in no stress falling on the verb stem itself if there are 
two monosyllabic clitics (5d) or one monosyllabic clitic followed by a clitic consisting 
of a single consonant that makes a heavy syllable (5e). The raising of the root /o/ of the 
imperative verb in (5b-5e), which only occurs to unstressed mid vowels, suggests that 
stress is not assigned cyclically to the root and then again to the host-clitic complex, but 
rather all in one go to the entire string: 


(5) Formentera Catalan (Torres-Tamarit & Pons-Moll 2015) internal clitics 
/kompr-a/ 


a. kompr-o 
buy  -2sG.IMP 


‘Buy!’ 

b. kumpr-o -l 
buy | -2sG.IMP-3sG.M.ACC 
‘Buy it/him!’ 

c. kum'pr-a =n 
buy | -2SG.IMP-PART 
‘Buy some!’ 

d. kumpr-a -mo  -lo 
buy -2sG.IMP-1SGDAT-3SG.F.ACC 
‘Buy it/her for me!’ 

e. kumpr-a -mo  -l 


buy -2sG.IMp=1sG.DAT=3M.ACC 


‘Buy it/him for me!’ 


Internal clitics meet the criteria of low selection, predictable semantics, and so forth, 
but phonologically they are unusual - they act like stem-level affixes. The situation in 
Formentera is similar to that described for Lucanian Italian clitics by Peperkamp (1997). 
Formentera and Lucanian internal clitics are wholly included within the phonological 
word and count from the outset in the calculation of where a right edge trochaic foot 
should be built. 

More generally speaking, internal clitics are prosodically deficient words that act more 
cohesively in their phonological interaction with their host than their other characteris- 
tics would suggest they should. This is captured in Selkirk’s framework by having them 
form a single prosodic word in combination with their host. Their hallmark is that they 
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are stressed and generally processed by the word-bounded on the first pass, acting just 
like a stem-level affix. This means that they can never receive a second stress all their 
own, but they may happen to be in position to take the stress of the whole host-clitic 
string, as illustrated in the Formentera form (5d) [kumpra='moa=la], where the clitic /ma/ 
receives the penultimate stress for the whole host-clitic complex. I suggest that we can 
profitably extend this insight onto bigger, more independent words than clitics. We will 
see in §3 that Modern Greek treats the otherwise independent pieces of a compound 
word as a single domain for stress assignment — a comparatively rare phenomenon. And 
Macedonian (§4) treats prepositions, even polysyllabic ones, as a single stress domain 
with their objects and treats certain common collocations of adjective + noun as a single 
stress domain as well. To put it another way, one occasionally encounters situations 
where a pair of words that should be expected to be prosodically independent can act 
like ‘internal words, parallel to the notion of internal clitics. I will link this unusual phe- 
nomenon to frequency of collocation. The more exemplars there are of a string of words, 
repeatedly heard together, the more likely they are to be internalized as a rhythmic group 
and to be reified as a domain for the assignment of a single stress. 

At the other end of the continuum of independence of clitics lie the Prosodic Word 
clitics, which receive postlexical treatment as independent prosodic words even though 
they are underlyingly prosodically deficient. They are illustrated by the prevocalic clitics 
of Bilua (Obata 2003, reported in Anderson 2011), where preconsonantal proclitics are 
normally free - they are outside the domain of stress assignment, which targets the first 


syllable of the word: 


(6) Bilua (Obata 2003, Anderson 2011: 2004) free clitic 


o= Boubac-k =a 
3sc.M=kill =3SG.FEM.OBJ=TNS 
‘He killed it? 


When a host word begins with a vowel, however, the usually stressless clitics instead 
take on a derived prosodic word-hood of their own and receive an independent stress: 


(7) Bilua (Obata 2003, Anderson 2011) PWd clitic under duress 
o:  die-k =a 
3sc.M-call =3SG.FEM.OBJ=TNS 
‘He called her. 


In this case, the clitic is forced to act like content words do normally. 

A referee has expressed some doubt about the proper analysis of Bilua clitics and, by 
extension, whether the category of prosodic clitics needs to exist at all. But it is useful to 
suspend our skepticism and contemplate such an entity because it illustrates the oppo- 
site side of a prosodic coin. The putative prosodic clitics are elements that are inherently 
prosodically deficient, but that can be forced under certain circumstances to act as in- 
dependent prosodic words with their own stress. Macedonian nouns and adjectives in 
certain frequent collocations are inherently prosodically independent elements that can 
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be forced under certain circumstances to act as prosodically deficient pieces of a single 
prosodic word, as are the words that make up a compound in Modern Greek. 

To complete the hierarchy, there are clitics whose characteristics of independence 
place them between internal and Prosodic Word clitics. The affixal clitics and the free 
clitics are probably the ones that linguists are most used to encountering. An affixal 
clitic shows some prosodic independence from its host: it is not included in the domain 
of footing on the first pass. In Selkirk’s framework it forms a recursive prosodic word 
with that host, and it therefore may receive or induce an additional stress or stress shift 
once it is included in a second pass of stress assignment. When word stress falls on the 
penult or antepenult of a content word, we find an additional stress added to a string 
of clitics if they can form a foot, as shown in the third column of (8) (Binary footing is 
indicated with parentheses.): 


(8) Neapolitan Italian (Peperkamp 1997) affixal clitics 
a. ‘konta konta-lo (konta)=(ti=lla) 
Tell mp tell.mp=it tell. imp=you=it 

b. ‘pettina pettina -lo ` (petti)na-(ti-llo) 

Comb mp comb.ımP=it comb.IMp=you=it 


Neapolitan Italian treats the morphological word, sans clitics, as the basic, initial domain 
of footing, but clitics are partially cohering, falling into a larger domain that still follows 
the basic principles of word-internal stress assignment. In Neapolitan, if the stress as- 
signed to the clitic sequence clashes with the stress of the host, the host's stress is lost: 


(9) Neapolitan Italian (Peperkamp 1997) affixal clitics with clash reduction 
fa fal  -lo fa -'tti =llə 
do mp do. mp-At do.imp=you=it 


The parallel for us here is that the pieces of compound words and larger phrases usually 
receive stress treatment individually for their component parts, but a postlexical pass 
of stress assignment can promote, move, or demote one of the stresses once the two 
elements are considered together in a larger domain. In English compounding, as we 
have mentioned, the component prosodic words each receive a lexical stress but the 
rhythm of the two is adjusted when they come together, promoting the first stress to 
primary: 


(10) English compound 
linguistics department — linguistics de partment 


And in English phrases, which typically have the most prominent stress on the final 
content word, the Rhythm Rule (Hayes 1984) adjusts the tertiary and secondary stresses 
of the first word in that phrase to avoid stress clash with the primary stress. 
(11) English phrase 
x x 
x x x x 
(x x) (x .) (x .)69) (x .) 


[Japanese] [language] — [Japanese] [language] 
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The last type of clitic we need to discuss is ‘free’? Such clitics simply do not interact 
with their host for the purposes of stress assignment. They are not themselves indepen- 
dent prosodic words but do not fall inside of any prosodic word and thus are invisible 
to every pass of footing. For instance, Barcelona Catalan verbs (Torres-Tamarit & Pons- 
Moll 2015) show the same stress whether or not they have enclitics, and the stress can 
fall three or even four syllables from the end of the word, in contravention to the gen- 
eralization that there is a three-syllable final stress window and that stress is typically 
penultimate. 


(12) Barcelona Catalan (Torres-Tamarit & Pons-Moll 2015) 
a. ‘kompr-a 
buy -2SG.IMP 


‘Buy!’ 

b. ‘kompr-a -mo =l 
buy -2SG.IMP=1SGDAT=3SG.F.ACC 
‘Buy it/her for me!’ 

c. ‘kompr-a siia.  -l 


buy -2sG.IMp=1SGDAT=3SG.M.ACC 


‘Buy it/him for me!’ 


The parallel for larger units is this: in §4.4 we will see that while some Macedonian 
prepositions are inherently stressless and are included in a stress-assignment domain 
with their objects (i.e. they are ‘internal’), others always remain stressless and do not 
form a domain with their objects (i.e. they are free). 

Even though stress is very often iterative within a word and its stem-level affixes, it 
can be unable to cross even the relatively weak boundary of a word level suffix. Consider 
for instance the lack of movement of stress from the base in English words suffixed by 
word-level suffixes such as #hood and #ly, and the concomitant location of stress outside 
the lexically mandated window of the last three syllables: 


(13) English 
a. ‘dialect#hood (c.f. ‘dialect, ‘dia lect-al) 


b. ‘manifestly (c.f. manifest, manifestation) 


If even an affix can be outside the domain of stress assignment and other phonological 
processes, it is not surprising that stress assignment often does not count clitics on the 
initial pass (or ever), let alone treat them as single domain compound words, words 
plus function items leaning on them, or the strings of prosodically independent content 
words. 

Let us tie these increasingly unlikely domains for stress assignment to frequency of 
collocation. Cohesive affixes are typically less productive and more selective with respect 


27 


Ellen Kaisse 


to the stems they combine with than are non-cohesive affixes, which are somewhat clitic- 
like. There is thus a hierarchy of selectiveness tied to frequency of collocation:* 


(14) Hierarchy of selectiveness 

e stem-level cohering affixes 

e word-level non-cohering affixes 

e clitics 

* compound words 

e words in fixed expressions 

e collocations involving closed class items, such as prepositions, even if they 
are polysyllabic 

e truly novel or unpredictable collocations that are clearly not listed in the 
lexicon 


For instance, Fabb (1988) discovered that the English stem-level adjective-forming suf- 
fix -ous has very narrow sectional restrictions. It cannot attach after any other affix at 
all. It's clear that -ous is stem level because it affects the stress of its base (moment, 
mo mentous), triggers Trisyllabic Laxing (‘6men, óminous) and meets the various other 
tests for cohesion in the literature. Similarly, the stem-level adjective-forming suffix - 
ic attaches only after the suffix -ist (capitalistic), and the stem-level adjective-forming 
suffix -ary attaches only after the suffix -ion (revolutionary). On the other hand, word- 
level adjective-forming #y can attach productively to virtually any noun (chocolate#y, or 
protein£y, recently heard on a yoghurt commercial), and it can occur in novel coinages 
after most other noun-forming suffixes (musi-cian#y, profess-or&y) Like clitic-host se- 
mantics, the meanings of words with word-level affixes are usually transparent. Finally, 
notice that ‘chocolate#y has primary stress in the same place as the base chocolate, even 
though, if ‘chocolate is pronounced with three syllables, stress in ‘chocolate#y falls in an 
otherwise impossible pre-antepenultimate position. The same is true for ‘manifestly (13b). 
Non-cohesive affixes are in a sense invisible to the lexical phonology of the language and 
can violate various of its requirements. 

Clitics are even less selective than word-level affixes. For example, many Romance, 
Greek and Slavic clitics attach to any verb and after any set of affixes. Similarly, the 
English contracted auxiliary =s can be found leaning leftward on words of almost any 
category. 


(15) English 
a. That man's a linguist; 


b. The thing you are leaning on's not safe; 


c. What you think's not important; 


4 A referee raises the excellent question of “whether this hierarchy predicts an implicational typology of 
stress domains, e.g. if a language treats compound words as a single stress domain, then clitics and all 
affixes should be parsed within the same stress domain as their host.” I do not know the answer to this 
question, though it does seem to be true for Modern Greek (discussed for instance in Anderson 2011 as 
having affixal clitics which affect the stress of their hosts) and for Macedonian. 
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d. Skin that looks pink’s an indication of good circulation. 


And both within compounds and within sentences almost any word can be followed by 
almost any other word - there are almost no selectional requirements. However, some 
words belong to closed classes - prepositions, relative and interrogative elements, and 
other function words and therefore will recur sequences more frequently. This cline lines 
up with the likelihood of two elements being taken into a single stress domain. 

The hierarchy in (14) will take us through the extended domain cases of Macedonian, 
where prepositions - even polysyllabic and semantically rich ones - are unstressable on 
their own, always forming a stress domain with their complements or being unstressable, 
and where frequent collocations of content words, particularly involving frequent words 
or collocations with unpredictable or frozen semantics, are stressed as if they were single 
words. 


3 Compounds and the stress domain continuum 


The continuum of domain size we have observed for clitics continues outward into com- 
pounds. To review, there are clitics of various sorts: the ‘internal’ type which is con- 
sidered in the first pass of stress assignment, the more familiar type which figures in a 
second pass that takes into account previously assigned stresses, and a third, free type, 
which is never counted for stress. My impression is that it is even more uncommon 
to find compound words - i.e. words made of otherwise prosodically independent ele- 
ments — forming a single domain for the first pass of footing. Rather, as we noted earlier, 
members of a compound are rather like affixal clitics, where a postlexical instantiation 
of rhythm may adjust the stresses on the basis of the newly available material but does 
not erase earlier, lexically assigned footing. But this is not always the case. 

Modern Greek demonstrates the comparatively rare type of compounding where a 
compound word is stressed as a single domain. It has been shown instrumentally by 
Athanasopoulou & Vogel (2014) that the well-known traditional description, sharpened 
in Ralli (2013), of a single, usually antepenultimate primary stress, is correct. The stress 
is placed without regard to where the stresses would fall in the individual members in 
isolation. A compounding morpheme -o is often inserted at the end of the first member 
(replacing any final inflectional ending) and stress falls on one of the last three syllables 
of the whole compound, most commonly the antepenult: 


(16) Modern Greek (Athanasopoulou & Vogel 2014; Ralli 2013) 


a. lik -os 
Wolf-M.sG.NOM 
b. 'skil-os 


dog-M.SG.NOM 
c. lik -o-  skil-o 
wolf-cMPp-dog-N.sG.NOM 


^wolfhound' 
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d. ‘riz -i 
rice-N.SG.NOM 

e. yala 
milk 

f. riz-o- yal -o 
rice-CMPD-milk-N.sG.NOM 


‘rice pudding’ 


Tokyo Japanese (Poser 1990; Kubozono 2008) has a similar phenomenon whereby a single 
pitch accent is assigned within a compound based, they argue, on foot structure. 

A referee points out that English compounds, especially those ending in the element 
-man, can sometimes have only a single stress, reminiscent of the Greek case here and 
of some Macedonian cases to come. While I have not found a published study on this 
phenomenon, Language Log (Liberman 2015) has an informative post that discusses the 
unstressed, hence reduced, final vowels in such words as fireman, clansman, gentleman 
versus the full final vowels in words like caveman, handyman, and weatherman. A lively 
set of reader responses about which words have a schwa versus the expected compound 
stress and full final vowel seems to lead to the conclusion that the longer a -man com- 
pound has been in English, the more likely it is to have a single stress. The situation here 
may be the demotion of a compound to a single word over time rather than a Greek-like 
treatment of a compound word as a single stress domain, but it falls under the general 
rubric of familiarity breeding unitary stress domains. However, the reader consensus is 
that there is no simple connection to actual contemporary frequency in the behavior of 
-man. 

I had believed that the Greek case was as far as regular stress domains ever extend. 
However, there is at least one case I am now aware of where the domain extends into 
some combinations of prosodically independent words - Literary Macedonian.? 


4 Enlarged stress domains in Macedonian 


4.1 Overview 


We have seen that while stem level affixes virtually always are taken into account in 
the first pass of foot building, many languages have stress neutral word-level affixes like 
English #ly, #ness, that are not part of the stress domain. Indeed, stress neutrality seems 
to be a recurrent, if not definitional characteristic of non-cohesiveness. This suggests that 
rhythm is not easily maintained or phonologized over large domains. Continuing along 
this cline, we saw that there are clitics of various sorts, some of which are considered 
in the first pass of stress assignment, some in a second pass that takes into account 
previously assigned stresses, and some of which are never counted for stress. Finally, we 
saw that compound words are only rarely the domain of initial footing. 


5 Tam very grateful to Ryan Bennett for bringing case to my attention. 
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A survey of stress assignment, beginning with the compendious Hayes (1995) con- 
firms the general impression that stress-assigning processes are lexical, not postlexical. 
But there is an exception, well-known among Slavicists, that in a sense proves the rule. 
Macedonian as described by Lunt (1952) and Koneski (1976) (analyzed in generative terms 
by Franks 1987; 1989 can build initial feet over certain units larger than the word. The 
example treated by Lunt and Koneski comes from what is termed ‘Literary Macedonian’ 
At first I was concerned that this might be an artificial language and that the cases of 
large domain stress could be the creation of scholars. However, Lunt (1952: 5-6) and 
Franks (1987) explain that the literary language is simply a pan-dialectal creation-by- 
commission that takes features from several western and central dialects but does not 
invent them. The large domain stress effects are found in several dialects, though some 
of the details differ from spoken dialect to spoken dialect. 

Literary Macedonian has regular antepenultimate stress: 


(17) Literary Macedonian (Lunt 1952; Franks 1987; 1989) 


a. beseda 
lecture 


b. be'seda-ta 
lecture-DEF 


c. vodenitfar 
miller 


d. vodeni'far-i -te 
miller ` -Pr-pEF 


As is usual in such systems, monosyllables are stressed and disyllables are stressed on 
the penult. (See Halle & Vergnaud 1987: 53 for a full analysis.) But some syntactically 
complex strings can be stressed as single units, termed ‘enlarged stress domains’. These 
domains are not, for the most part, unfamiliar to phonologists, as they involve poten- 
tially prosodically deficient function words such as negative or interrogative particles 
and pronouns plus the following word. However, there are also some strings of modi- 
fiers plus nouns. There are also polysyllabic propositions, which are cross-linguistically 
unlikely to be prosodically deficient (i.e., to be clitics), yet are stressed as a unit with 
their objects. I summarize Lunt's list in (18): 


(18) Literary Macedonian enlarged domains (Lunt 1952: 23-25) 

a. monosyllabic words which have no accent of their own, both proclitic and 
enclitic. The proclitics include personal pronouns, particles and prepositions. 
The enclitics include definite articles and certain indirect personal pronouns. 

b. the negative particle plus the verb, and any elements that fall between them, 
such as the present tense of the verb ‘to be; even though these normally have 
their own accent. 

c. the interrogatives meaning ‘what; ‘how, ‘where, and ‘how many; plus the 
following verb, and any stressless elements between them. 
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d. prepositions and their pronominal objects. 


e. prepositions “when used with concrete, spatial meanings" “and in a number 
of set phrases” when their object is non-definite. 


f. anumeral and the noun it modifies. 


g. some combinations of adjectives and the nouns they modify. 


Let us begin with (18g), which is the most typologically unusual. We will return to the 
also-surprising prepositions 18d) in §4.3. 


4.2 Prenominal adjectives 


It is worth quoting Lunt's remarks about the combination of adjectives and nouns in 
their entirety. 


The combination of adjective+substantive under a single accent is common to many, 
but not all, of the central dialects on which the literary language is based, and in 
any case it is not productive. Such a shift of accent is impossible if either the noun 
or adjective come from outside the narrow sphere of daily life. Therefore this usage 
is not recommended. Conversational practice is extremely varied. Place names 
tend to keep the old [i.e. single domain, antepenultimate-EMK] accent... Often used 
combinations tend to keep the single accent: ‘soured milk’ [yoghurt], ‘dry grapes- 
raisins’, ‘the left foot, ‘the lower door; ‘(he didn't see a) living soul’. Still one usually 
hears [the words stressed as separate domains]. Only with the numbers and perhaps 
a few fixed phrases [dry grapes] is the single stress widespread in the speech of 
Macedonian intellectuals. (Lunt 1952: 24-25) 


What we should note then, is that open class content words are only grouped into a 
single stress domain when the words are frequent (“narrow sphere of everyday life”), 
especially when such words are also in frequent collocation with one another (“often- 
used combinations”). And even though this system arose in some of the dialects on 
which the new literary language was based, it apparently is not easily learned. Lunt 
here reports that the single domain stress on attributive adjective plus noun has not 
been taken up by the educated speakers who adopted it. 

Franks (1987; 1989) combines Lunt’s examples with those of Koneski (1976) and others 
he and colleagues elicited. Here are several representative ones, including those men- 
tioned in the above quotation. 


(19) Literary Macedonian (Lunt 1952; Franks 1987: 989) 


a. dolnata porta 
lower gate 


b. kiselo mleko 


sour milk 


‘yoghurt’ 
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c. suvo grozje 
dry grapes 
‘raisins’ 

d. Cr'vena voda 
red water 


‘name of a village’ 


e. ‘star tfovek 
old man 


f. novata kukia 
new house 


The examples in (19) are not argued to be compound words by the various sources, prob- 
ably because they have the regular syntax of noun phrases and many such as (a), (e) 
and (f) have compositional semantics; however, they are strings that that are in frequent 
collocation and may have come to take on a specialized, less predictable meaning. Given 
the silence of the sources on the question of compound versus phrase, perhaps the most 
noncommittal and appropriate term for them is Erman and Warren’s (2000) “prefabs.” 
Prefabs are not idioms or lexicalized compounds with unpredictable meaning and pecu- 
liar syntax but simply common and conventional collocations that can be stored in the 
lexicon while having compositional meaning and normal syntax. 

The matter of whether a string of words that occur together frequently is a compound 
or a phrase is a vexed one, even for well-studied languages like English. (See Plag et al. 
2008 for an overview of the controversy.) 


4.3 Another compound or adjective-noun case 


Ryan Bennett informs me that some dialects of Modern Irish show a similar phenomenon 
in their adjective plus noun pairs. For instance, Mhac An Fhailigh & Éammon (1968) 
states for the dialect of Erris that while such a phrase typically has word stress on 
each element, frequent collocations can show a single stress. Thus the infrequent ['drax 
'xladəx] ‘bad shore’ has stress on each member, but ['fan vian] ‘old woman; has only a 
single stress. Mhac An Fhailigh calls these ‘loose’ versus ‘close’ compounds, rather than 
phrases vs. compounds, but does not give clear criteria for what defines a compound 
versus a phrase. I have not yet found an extended description of this phenomenon in 
Irish so I mention it only in passing here. 

The adjective plus noun enlarged domains of Macedonian (and perhaps of Erris Irish) 
are the exception that proves the rule - phrases are not normally the domain of stress 
assignment. If what leads to phonologization of rhythmic tendencies within words is 
frequent collocation, it is gratifying that the only extensively described example of an 
enlarged domain that I could find is one involving frequent collocation of common words, 
and it is consistent with my hypothesis that the Erris Irish seems to accord with the 
same generalization. But Lunt tells us that the Macedonian example is hard to learn 
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and is being eliminated over time. Stress rules really don’t like to apply outside a single 
content word, its affixes (or some of them) and, sometimes, its clitics. 


4.4 Macedonian prepositions and their objects 


The stress behavior of prepositions in Macedonian is also worth looking at in a bit of 
detail. Since prepositions are the heads of phrases and especially since they can be poly- 
syllabic in Macedonian, one might expect they would be independent prosodic words 
on their own. Looking at a more familiar case, English monosyllabic prepositions like 
to and for are optionally proclitic on their complements, but polysyllabic prepositions 
like behind and above are not. But Macedonian prepositions, regardless of their apparent 
prosodic heft in terms of syllable count, never receive stress as a domain on their own. 
They are, apparently, as prosodically deficient as clitics. Lunt devotes several pages (52- 
65) to the individual behavior of some two dozen prepositions because their stress be- 
haviors are somewhat idiosyncratic. The basic generalization is that prepositions have 
or receive no accent of their own. They either form a single stress domain with their 
nominal or pronominal complement, acting like internal clitics (20a); or, in some cases, 
they behave like free clitics, never receiving a stress of their own nor receiving the single 
stress of the enlarged domain (20b). 


(20) Literary Macedonian (Franks 1989) 


a. okolu selo 
near village 


‘near the village’ 


b. otkaj gradot 
from direction 


‘from the direction (of)’ 


Prosodic deficiency in prepositions makes sense from the point of view of the frequency 
of collocations. Prepositions are closed class items and they require a nominal or pronom- 
inal complement. Thus, when they are heard, they are always in collocation with a noun 
phrase. 


4.5 Summary 


As we mentioned earlier, it is helpful to think of the Macedonian collocations of full 
words that are stressed as a single word as the inverse of the Prosodic Word clitics of 
Bilua in (7), re-illustrated below in (21). PWd clitics are underlyingly prosodically de- 
ficient but can receive their own stress under duress. The Macedonian adjectives are 
elements that are not inherently prosodically deficient, even in Macedonian, yet in cer- 
tain common collocations, they are being treated as a piece of a single prosodic word, like 
internal clitics. This is illustrated in (22). From a cross-linguistic perspective, the Mace- 
donian polysyllabic prepositions are a bit unexpected in being inherently prosodically 
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deficient, parallel either to internal (20a) or free (20b) proclitics. These are illustrated in 
the second line of (22) and in (23). 
(21) Prosodic independence of underlyingly prosodically deficient clitic 

PPh 


Clitic Host 


o odie ‘he called’ 


3sc.M-call 


(22) Prosodic dependence of underlyingly prosodically independent words and of 
‘internal’ prepositions 


PPh 

ri 
kiselo ES ‘sour milk’ 
okolu selo ‘near the village’ 


(23) Macedonian unstressable preposition parallel to free proclitic 


PPh 


Preposition N 
otkaj ‘gradot ‘from the direction’ 


5 Conclusion 


Why is stress assignment almost always word-bounded? Why is it restricted only to co- 
hering affixes in some languages, and how does it manage to extend outward to clitics in 
others? The hypothesis offered in this paper and in Kaisse (forthcoming) is that rhythm, 
like long-distance harmony, is grammaticized when certain syllables are heard together 
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over and over.® This usually only happens within words, including their affixes, and, 
occasionally, with words plus slightly more independent closed class items such as cli- 
tics. Because word-level affixes have fewer selectional restrictions than stem-level ones, 
stem-level affixes are the most likely to be included in the initial domain of footing. And 
since clitics have even fewer selectional restrictions than word-level affixes, they are 
even less likely to be included. Since there are thousands of independent content words, 
the collocation of any two is much less likely to have many exemplars. However, both 
compound words (as in Modern Greek) and ‘prefabs’ - collocations of frequent words in 
frequently heard phrases like ‘old man’ - are more prone to becoming a single domain 
than just any sequence of content words. Similarly, collocations of closed class items 
like prepositions may cross the border and form a single initial stress domain. 

There may also be a functional motivation for the prosodic independence of content 
words. Extending the domain for the initial building of feet would eliminate the demarca- 
tive function linguists often ascribe to stress. Because stress is culminative, with exactly 
one primary stressed syllable per content word, it allows us to identify the independent 
lexical words of a phrase. And because stress is often predictably located on an initial, 
final, or penultimate syllable, it also allows us to identify word boundaries, helping the 
listener locate the end of one word and the beginning of the next. 

Let us return to a question I raised in $1. Does the regular, alternating stress assign- 
ment found within words stem from the grammaticization of rhythmic tendencies in 
sentences? Or are rhythmic tendencies in sentences the result of "leakage" from regular 
phonology within the word? The counterparts of word-bounded phonological processes 
are present and, in the current century, detectable, as statistical, gradient tendencies in 
larger domains such as compounds and phrases. Shih (2016) shows that syntactic choices 
are gradiently influenced by the rhythmic principles that we see operating obligatorily 
inside words as alternating stress assignment, rhythm rules, stress clash avoidance, lapse 
avoidance, and so on. She argues that word order and construction choice can be re- 
cruited in response to these phonological pressures from inside the lexical phonology, 
unearthing small effects in the choice of optional syntactic variants that favor more eu- 
rhythmic outputs. Thus she finds that the genitive constrictions X's Y and Y of X are 
skewed toward avoidance of stress clashes and lapses. Similarly, the dative alternation 
verb IO DO vs. verb DO to IO trends in a small but discoverable way towards phrases 
that are more eurhythmic (Anttila, Adams & Speriosu 2010; Shih 2016). Along the same 
lines, Hammond (2013) finds that the Brown and the Buckeye corpora contain statis- 
tically fewer instances of underlyingly clashing prenominal adjective-noun pairs than 
would be expected, so that adjectives like aloof and unknown, with final primary stress, 
are underrepresented when they are prenominal, while adjectives like happy and finite, 
with initial primary stress are not. Here we run into a chicken and egg problem. The 
more traditional view in historical linguistics is the Neogrammarian one: these pressures 


é Although tone rules are more powerful in their ability to escape the word, for reasons that, as far as I know, 
are not well understood (see footnote 2), word-boundedness is nonetheless certainly robustly attested for 
tone rules as well as other iterative processes, and I would propose that the same factors of frequent 
collocation should underlie that domain restriction. 
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are there all along as variation, with the syntax reflecting them before they are phonol- 
ogized. This is how I envisioned the progression in Kaisse (forthcoming). Along the 
same lines, Myers & Padgett (2015) note the tendency of learners to extend phenomena 
like utterance-final devoicing to the word-level. Participants in their artificial language- 
learning experiments generalized beyond the training data, applying word-final devoic- 
ing inside of utterances in novel strings, not just the kind of utterance-final devoicing 
they were trained on. Myers & Padgett (2015: 399) conclude that “learners are biased 
towards word-based distributional patterns.” They speculate that this is because we hear 
and store many more exemplars of words than we do of utterances. Baumann & Ritt 
(2012) take a similar view of the direction of development of lexical stress patterns, in- 
vestigating through game theoretic simulations the lexicalization of stress in English bi- 
syllabic words, which can have either initial (lentil, research) or final (ho tel, re searchy) 
prominence. They argue that “words adopt, on average, those stress patterns that pro- 
duce, on average, the best possible phrasal level rhythm? But Shih and Martin's views 
are equally plausible: the obligatory, regular stress system of the language spreads from 
words into gradient choices in the syntax. I suspect that both directions are operating at 
the same time, since language change results from a constant push and pull of variation 
and optimization. 

Whatever the direction of change, we have seen that it is rare for rhythmic tenden- 
cies beyond the word to become phonologized and to be reflected as foot construction 
processes that take material outside of words as their domain. I have argued that there 
are simply not enough exemplars of adjacent words heard repeatedly together to result 
in the reification of such a large domain. 
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In their seminal paper, de Chene & Anderson (1979) make a strong claim that pre-existing 
vowel length contrast is a necessary condition for the phonologization of vowel length 
through compensatory lengthening. Compensatory lengthening is thus predicted to be al- 
ways a structure-preserving change. Since that time, the claim has been challenged in nu- 
merous works (Gess 1998; Hock 1986; Morin 1992, among others). A closer examination 
of the cited counterexamples to de Chene and Anderson’s claim reveals certain generaliza- 
tions. Some apparent counterexamples, such as Samothraki Greek (Kiparsky 2011), involve 
the full vocalization stage of the consonant with the subsequent coalescence of that conso- 
nant with the preceding vowel. In other cases, such as Old French (Gess 1998) and Komi 
Izma (Hausenberg 1998), heterosyllabic or heteromorphemic identical vowel sequences are 
attested elsewhere in the language. The former cases involve the reanalysis of vowel length 
before weakened consonants that is indeed strengthened by the independent existence of 
the vowel length contrast in the languages in question, in support of de Chene and Ander- 
son’s claim. The former cases are not truly compensatory, and phonemic vowel length is 
introduced into the language through coalescence. 


1 Introduction 


In their seminal paper on compensatory lengthening (CL), de Chene & Anderson (1979) 
make a strong claim that the independent existence of a vowel length contrast is a nec- 
essary condition for the phonologization of vowel length through compensatory length- 
ening. CL is thus predicted to be always a structure-preserving change that cannot in- 
troduce contrastive vowel length into a language. Since that time, the generalization 
in its stronger version (certain sound changes are always structure preserving) or in its 
weaker version (structure preservation is a tendency in sound change) has been accepted 
and developed by linguists otherwise advocating very diverse and sometimes incompat- 
ible approaches to sound change, in particular, in research programs by Paul Kiparsky 
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visited yet again. In Claire Bowern, Laurence Horn & Raffaella Zanuttini (eds.), 
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(Kiparsky 1995; 2003) and Juliette Blevins (Blevins 2004a; 2009). However, the generaliza- 
tion has also been challenged in several works. For instance, Gess (1998) takes issue with 
de Chene and Anderson’s claim, suggesting that in general, “structure preservation is ir- 
relevant as a theoretical construct” and proceeds to argue that de Chene and Anderson’s 
interpretation of the Old French data, which is their main example, is incorrect, and that 
in Old French compensatory lengthening happened before the introduction of the other 
sources of length distinction into the language, contrary to de Chene and Anderson’s 
analysis. CL through onset loss, such as in Samothraki Greek (Topintzi 2006; Kiparsky 
2011; Katsika & Kavitskaya 2015), is also a potential counterexample to the claim that CL 
is a structure-preserving change, along with the case of Occitan (Morin 1992). In other 
languages without pre-existing vowel length contrast, such as Andalusian Spanish (Hock 
1986), Ilokano (Hayes 1989) and the Ngajan dialect of Dyirbal (Dixon 1990), vowel length 
that is the result of CL remains allophonic and predictable. In yet another type of cases, 
such as Komi IZma (Harms 1967; 1968; Hausenberg 1998), vowel length from CL appears 
to be quasi-phonemic and on its way to phonologization. 

CL is a common sound change that has occurred independently in many languages 
across the world, and only a few potential counterexamples to de Chene and Anderson's 
claim have been reported. In principle, we could have been done simply restating this ob- 
servation that supports a weaker but less controversial claim that there is a tendency for 
CL to occur in languages with pre-existing vowel length, in the spirit of proposals about 
structure-preserving sound change by either Kiparsky (2003) or Blevins (2009), but we 
will proceed to examining the most widely discussed counterexamples to de Chene and 
Anderson's claim. A closer examination of these counterexamples reveals certain gener- 
alizations. The working analyses of some cases proposed in the literature involve the full 
vocalization stage of the consonant with the subsequent coalescence with the preceding 
vowel, such as in Samothraki Greek (Sumner 1999; Kiparsky 2011). In other cases, such as 
Old French (Gess 1998) and Komi Izma (Hausenberg 1998), heterosyllabic or heteromor- 
phemic long vowels (or rather vowel sequences) are attested elsewhere in the language. 
We shall argue that the cases of CL that do not involve full vocalization (Hayes 1989; Kav- 
itskaya 2002) are in a sense truly compensatory, as opposed to instances of consonant 
vocalization and subsequent vowel coalescence. The former cases involve the reanalysis 
of vowel length before weakened consonants that is indeed strengthened by the inde- 
pendent existence of the vowel length contrast in the languages in question. In the latter 
cases, phonemic vowel length is introduced into the language through coalescence. 


2 Ihe problem 


CL through consonant loss is defined as a process whereby a vowel lengthens in com- 
pensation for the loss of a tautosyllabic consonant. CL through coda loss is the most 
typologically wide spread process, as either a diachronic change or a synchronic alter- 
nation. An example of this kind of CL in the Ižma dialect of Komi (a Uralic language 


1 T do not address CL through vowel loss in this paper. 
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of the Permian subgroup) is shown in Table 1 (Harms 1967; 1968; de Chene & Anderson 
1979). 


Table 1: CL through coda loss in Komi Izma (after Harms 1968: 104). 


Stem Pastisc Infinitive 


a. lij- lij-i lij-ni ‘shoot’ 
mun- mun-i mun-ni ‘go’ 

b. kil- kil-i ki:-ni ‘hear’ 
sulal-  sulal-i sulo:-ni ‘stand’ 


In Komi Izma, the lateral /l/ deletes in the coda position with the lengthening of the 
preceding vowel, as illustrated in (b) of Table 1.2 De Chene & Anderson (1979) propose 
that CL through consonant loss should be analyzed as an instance of the conversion of 
coda consonants, /l/ in the case of Komi Ižma, to glides (either semivocalic or laryngeal), 
/w/ in the case of Komi Izma, with the subsequent monophthongization of the result- 
ing vowel-glide sequence in the syllable nucleus, as in, for example, "kil.ni > "kiw.ni > 
ki:.ni ‘to hear’, with the intermediate stage unattested in Komi Ižma but present in other 
dialects of Komi, such as Vychegda Komi (Lytkin 1966; Lytkin & Teplyashina 1976). 

De Chene and Anderson (1979: 508) emphasize that their account is phonetic in nature 
and accounts for CL as a historical sound change, and not as a synchronic alternation: 


We will argue that these processes can be understood as the transition of the con- 
sonant, through loss or reduction of its occlusion, to an eventual glide G. It is the 
monophthongization of the resulting sequence (X)VG(Y) which gives rise to a syl- 
lable nucleus that is interpreted as distinctively long. In consequence, cases of ap- 
parent compensatory lengthening can be analysed (as far as their phonetic bases 
are concerned) as a combination of consonantal weakening in certain positions 
followed by monophthongization; and compensatory lengthening per se can be 
eliminated as an independent member of any inventory of phonetic process-types. 


This insight into the phonetics of CL serves as the basis for the analysis developed in 
Kavitskaya (2002), who maintains that CL is the result of the reanalysis of the longer 
phonetic duration of vowels as phonological length with the loss of tautosyllabic con- 
sonants. Kavitskaya (2002) maintains that vowels are more likely to be reanalyzed as 
phonologically long in the environment of more sonorous consonants after the loss of 
the said consonants, which makes the differences in vowel length unpredictable. De Ch- 
ene and Anderson's (1979) analysis of CL as a process whereby consonants weaken to 
glides supports Kavitskaya's phonetic analysis, which is shown in Table 2. 

The schematic representation in Table 2 considers two possible situations where the 
consonants X and Y are lost. Prior to the deletion of the consonants, both vowels are 


? The lengthened /a/ surfaces as [o:]. 
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Table 2: CL through coda loss (Kavitskaya 2002: 9). 


Stage 1 Stage 2 Phonologization 
(before consonant loss) (consonant loss) 


CVX C V C C V C V: 


CNN C V C C V C V 


correctly analyzed as phonologically short. In the case when the listener mishears the 
more sonorous consonant X as absent, the longer transitions are reinterpreted as a part 
of the vowel, which is subsequently reanalyzed as long. The vocalic transitions to the 
less sonorant consonant Y are shorter, and with the loss of this consonant, there is no 
reinterpretation of vowel length based on its duration. The divide between X and Y is 
arbitrary, and the more sonorous the deleting consonant is, the more likely its deletion 
to be compensated by the lengthening of the vowel. 

Several later accounts of CL are mostly phonological. The most well-known of those 
is an account by Hayes (1989), who analyzes CL through consonant loss as the deletion 
of a weight-bearing coda while preserving its weight and reassigning it to the preceding 
vowel, as illustrated in (1) for Komi Ižma. The account holds that when the underlying 
coda /l/ is deleted, its mora is left behind (in an intermediate stage) and spreads to the 
preceding vowel, making it bimoraic and thus long: 


(1) CL through coda loss in Komi Ižma (after Hayes 1989) 


o o o o o o 
l n i js n i 


k i n i k i 


The reason for the necessity of the phonetic explanation in de Chene and Anderson's 
analysis and its conspicuous absence from Hayes analysis lies in the difference between 
the general approaches to CL taken by these two accounts. De Chene and Anderson's ac- 
count carefully distinguishes between a sequence of phonetic processes that comprises 
the weakening of occlusion followed by a monophthongization of the resulting vowel- 
glide sequences and the phonological reinterpretation of some of the outputs of this 
monophthongization as long vowels. While Hayes uses historical examples to illustrate 
his points (one of the examples being Attic Greek, where CL is arguably only a historical 
process with no synchronic alternations), the account he proposes is synchronic in na- 
ture and does not consider either phonetic or phonological stages of the sound change 
analyzed by de Chene and Anderson. 


44 


3 Compensatory lengthening and structure preservation revisited yet again 


One of the important predictions of de Chene and Anderson’s account concerns the 
systemic constraints on the phonologization of vowel length. They propose that the 
phonologization of vowel lengthening as a result of CL can happen if and only if the 
language in question has a pre-existing vowel length contrast. This prediction does not 
follow directly from de Chene and Anderson’s analysis, nor it is necessary for the ac- 
counts in the spirit of Hayes. It a sense, it is not a prediction per se, but rather a gen- 
eralization about the nature of CL as a sound change. In the following sections, I will 
consider several counterexamples to this claim, discuss the similarities among these ex- 
amples, and offer some speculation on why de Chene and Anderson’s generalization is 
at least a tendency in the languages of the world. 


3 CL with no pre-existing vowel length: Apparent 
counterexamples 


As can be inferred from an (admittedly small) survey of languages with CL in Kavitskaya 
(2002), CL is more often a structure-preserving sound change. In the majority of the 
cases of CL, this tendency indeed holds: out of 80 languages with historical CL sound 
changes listed in Kavitskaya’s (2002) survey, 72 or 90% occur in languages with pre- 
existing long/short vowel contrasts, while only 8 or 10% are found in languages without 
a pre-existing vowel length contrast? These 8 languages constitute counterexamples 
to the stronger version of the claim, which holds that CL as a sound change is always 
structure-preserving. However, first, the presence of counterexamples does not make 
the tendency false (it is just not a universal). Second, there seems to be an important 
difference between the cases that are structure-preserving and the cases in which vowel 
length (mostly allophonic) is potentially introduced into a language through CL. 


3.1 Old French 


Old French is the central example used by de Chene and Anderson to illustrate that CL 
as a sound change does not happen unless contrastive vowel length is independently 
present in the language. According to de Chene & Anderson (1979: 527-528), stated 
after Pope (1934: 79, 191), the diphthong [aw], inherited both from Indo-European and 
from Vulgar Latin, monophthongized to a short [o] in French by the middle of the 9th 
century, as in (2a). The loss of other consonants, such as velars before l and n and p/b 
and t/d before p/b, t/d, and s, that took place at approximately the same time, was not 
accompanied by CL either, as exemplified in (2b): 


3 This information is not explicitly present in Kavitskaya (2002) and was compiled by Blevins (2009). 
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(2) a. Monophthongization of [aw] to [o] in Old French (circa 850 AD) 
or < aurum ‘gold’ 
oser < ausare ‘to dare’ 
forge ‘forge’ < *faurga < fabrica ‘workshop’ 
parole < paraula < parab(o)la ‘word’ 
b. Loss of consonants g, k, p, and d in Old French (before 850 AD) 
agneau [ano] < agnellum ‘lamb’ 
maille [may] < mac(u)lam ‘stain’ 
route ‘road’ < (via) rupta ‘broken road’ 
aprés ‘after’ < adpressum ‘near’ 


Another wave of monophthongization, presumably through the weakening of the 
coda [l] to a labiovelar glide, happened in Old French by the 16th century, this time 
resulting in a long [o:], as illustrated in (3a). The loss of other pre-consonantal conso- 
nants, such as nasals (complete by the middle of the 16th century) and fricatives s and z 
(earlier), was accompanied by CL. Examples of the loss of fricatives are in (3b), and CL 
through the loss of nasals is illustrated in (3c). 

The examples in (3b) show the orthographic s that was preserved in such words until 
1740 (de Chene & Anderson 1979: 520). However, Pope (1934: 151) mentions that 12th 
century poetry suggests that the fricative had begun to drop before voiced consonants 
by this period. French loanwords in English, such as blame, male, and isle, do not have a 
pronounced [s], which adds more support to this conclusion: 


(3) a. Monophthongization of [aw] < [al] to [o:] in Old French (16th century) 
autre [o:tr] « alterum ‘other’ 
aube [o:b] ‘dawn’ « alba ‘white’ (fem.sg.) 
b. Loss of fricatives with CL in Old French (12th century) 
méler (ModFrench) « mesler (Old French) < misculare ‘to mix’ 
ile (ModFrench) < isle (Old French) < insula ‘island’ 


c. Loss of nasals with CL in Old French (16th century) 
fendre [fa:dr] < findere ‘to split’ 


rompre [r3:pr] < rumpere ‘to break’ 


De Chene and Anderson claim that the difference between the outcomes of the two 
Old French monophthongizations, as well as between the loss of consonants without 
and with vowel lengthening, lies in the fact that in the 9th century, the Old French 
vowel system did not exhibit contrastive vowel length, and so vowels did not lengthen 
as a response to the loss of consonants, while by some time in the 12th to 16th century 
a vowel length distinction was introduced into the system independently, and this pre- 
existing vowel length contrast made it possible for the reanalysis of the vowels that 
preceded the lost consonants as long. 

It is argued in de Chene (1985) that languages typically acquire vowel length through 
vowel coalescence (dubbed as vowel hiatus or geminate vowel clusters, de Chene & 
Anderson 1979: 520). De Chene and Anderson (1979) state that French obeys this rule 
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and acquires long vowels through the deletion of intervocalic consonants and subse- 
quent vowel coalescence in the period between the changes exemplified in (2) and (3). 
The examples of consonant loss and vowel coalescence are presented in Table 3. 


Table 3: Intervocalic consonant loss between identical vowels (de Chene & 
Anderson 1979 after Pope 1934). 


Modern French Old French Latin 


bâiller baailler bataculare ‘to yawn’ 

graal gradalem ‘dish’ 

aates adaptas ‘suitable’ (fem.acc.pl) 
sceau seel sigillum ‘seal’ 


Gess (1998) takes issue with de Chene and Anderson’s claim that CL is only possible 
in languages with a preexisting vowel length contrast. He argues with the claim on the 
basis of the evidence from Old French. The objection is that the putative long vowels are 
treated as disyllabic in 12th and 13th century poetry in Old French, as shown in (4) for one 
of the examples in Table 3. From the scansion of the octosyllabic line in (4), it is evident 
that graal ‘dish’ consists of two syllables for the purposes of poetic syllabification: 


(4) Le Roman de Perceval, late 12th century (Roach 1959: 3, 11, 76-77; Gess 1998: 357) 
Ce est le contes de GRAAL, 
123456 78 
‘This is the story of the Grail; 
Dont li quens li bailla le livre. 
"About which the count gave him the book 


The evidence from the scansion provided by Gess (1998) is questionable since syllab- 
ification in poetry is often conservative and reflects an earlier stage of the language. 
The scansion is also consistent with the possibility that vowel coalescence has already 
happened and long vowels scan as two syllables, with a poetic line becoming mora- 
counting rather than syllable-counting (see discussion in de Chene (1985: 76, 84ff) about 
such developments in Japanese and Tongan). If this is the case, then Old French does 
not constitute a counterexample to de Chene and Anderson's generalization. 

The metrical evidence presented by Gess (1998) is thus inconclusive. However, even 
if Gess' interpretation is correct and indeed his examples illustrate that at the time of 
CL in Old French there were heterosyllabic sequences of identical vowels, it could have 
been sufficient to strengthen the possibility of CL, as we shall further discuss for another 
example in the next section. 


^T am grateful to a reviewer for the discussion of this point. 
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3.2 Komi Izma 


It would be informative now to return to Komi Ižma, which does not have contrastive 
long vowels in the inventory, or any other allophonically long vowels, except for those 
that are derived by CL (Lytkin 1966; Lytkin & Teplyashina 1976; Hausenberg 1998). Thus, 
in principle, Komi Izma constitutes a counterexample to de Chene and Anderson’s claim 
interpreted broadly, as noticed by Gess (1998). The forms in Table 4 illustrate CL alter- 
nations in Komi Ižma: 


Table 4: CL through coda loss in Komi Ižma (after Harms 1968: 104-105). 


Stem Past Ise Infinitive 
kil- kili kini ‘to hear’ 
sulal- sulali sulo:ni ‘to stand’ 


Indefinite Definite Dative 


pi pijis pili ‘son’ 
pi pilis pili ‘cloud’ 
v9: volis voli ‘horse’ 


The deletion of l in Komi Ižma went through the stage of the loss of the occlusion 
of the liquid to the labiovelar glide w, followed by the monophthongization of the Vw 
sequence? The diphthongal stage is synchronically attested in related dialects of Komi, 
spoken in Vychegda and Syktyvkar, and there is also a dialect group in Komi that pre- 
serves the lateral (cf. va /vəl/ ‘horse’ in Komi Ižma vs. vəv /vəl/ ‘horse’ in Vychegda 
Komi vs. vol /vol/ ‘horse’ in Komi Yazva) (Lytkin 1966: 44-49, Lytkin & Teplyashina 1976: 
106-115).° 

De Chene and Anderson (1979) maintain that Komi Izma data do not counterexem- 
plify their generalization since the language has heteromorphemic long vowels (or vowel 
sequences), so sufficient contrast in vowel duration is present for CL to go through. 
Hausenberg (1998: 309) states that in dialects long vowels may develop through assimi- 
lation in forms like una-an < una-én ‘many’, baba-as < baba-is ‘his wife’. 

In a narrower sense, Komi Ižma is not a counterexample since CL does not introduce 
vowel length contrast into the language: vowel length is allophonic and predictable, and 
even though ‘son’ and ‘cloud’ look like a minimal pair, they are underlyingly /pi/ ‘son’ 
vs. /pil/ ‘cloud’. Abondolo (1998: 13) calls vowel quantity in Komi Izma “nascent”, thus 
interpreting vowel length distinction in the language as quasi-phonemic and possibly 
on its way to phonemicization. However, another view on the facts of Komi Izma CL is 
possible that provides additional evidence in support of de Chene and Anderson’s claim.’ 


5 Syllable-final /l/ frequently undergoes vocalization; cf., for instance, l-vocalization in BCS (South Slavic): 
beo /bel/ ‘white-masc’ (vs. bela ‘white-FEm’), video /videl/ ‘see-past.Masc’ (vs. videla 'see-PAST-FEM ). 

® Yet another dialect of Komi, Komi Inva, vocalizes /l/ into [w] in all positions (Lytkin 1966: 44-49). 

7 I am much indebted to a reviewer for the following discussion of vowel length and morphology. 
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These long vowels can arise through either inflection, as in Table 4, or derivation, as in 


(5): 


(5) CL in Komi Ižma (Collinder 1957: 309 via de Chene & Anderson 1979: 525) 
perna-al-as 
CYOSS-VERB-3SG.PRES 


‘he hangs (TRANS), as a cross on one’s breast’ 


In an important paper that defines the place of morphology in grammar, Anderson 
(1982) proposes that the traditional category of inflection is the subset of morphology 
that is relevant to the syntax. As a consequence, inflection depends on the results of 
syntactic operations and is post-syntactic, while derivation happens before syntax. Thus, 
according to Anderson’s model, the units of lexical storage are stems that “include all 
internal structure of a derivational sort” (Anderson 1982: 592). Endorsing this approach 
amounts to saying that, since long vowels resulting from the addition of derivational 
material are robustly attested in Komi Izma, the language has lexical long vowels even 
if none of them are morpheme-internal. 


3.3 Samothraki Greek 


One of the languages in which CL introduces phonemic vowel length into a system 
without pre-existing vowel length contrast is a dialect of Greek spoken on the island of 
Samothraki (Newton 1972a,b; Hayes 1989; Katsanis 1996; Sumner 1999; Kavitskaya 2002; 
Topintzi 2006; Kiparsky 2011; Katsika & Kavitskaya 2015). Samothraki Greek is not a 
usual case of CL in yet another respect since it is the loss of the onset, not the coda, that 
triggers tautosyllabic vowel lengthening, as illustrated in Table 5. 

In Samothraki Greek, the prevocalic r deletes with the lengthening of the following 
vowel in a) the word-initial onset of either stressed or unstressed syllable, as in Table 5a, 
and b) after a consonant in a complex onset, both in biconsonantal clusters, as in Table 5b, 
and triconsonantal clusters, as in Table 5c, both in stressed and unstressed word-initial 
and word-medial/final syllables: 


Table 5: CL through onset loss in Samothraki Greek (after Katsika & Kavitskaya 


2015).8 
Standard Greek Samothraki Greek 
a. 'ri.ze UE ‘root’ 
ce. vi.Oce i:.'vi.Oce ‘chickpeas’ 
r2. Óv.ci.ne u:. Oe cn ‘peaches’ 
b.  vrisi vis ‘faucet’ 
Ori.mi ‘Bim ‘shard’ 
C.  "*.spros 'g.spu:s ‘white’ 
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The examples in Table 6 show the synchronic status of r-deletion in Samothraki Greek: 
the rhotic surfaces in the coda and as a first consonant in a complex onset, but deletes 
intervocalically. On the basis of such alternations, Kiparsky (2011) argues that the pres- 
ence of r-zero alternations constitutes evidence for the synchronic status of CL in the 
language: 


Table 6: Alternations in Samothraki Greek (Katsika & Kavitskaya 2015: 7). 


Cer ‘hand’ po. der ‘foot’ 
'ce.rje ‘hands’ po. óv.rje ‘feet’ 
ciuóje ‘little hands’ po.de.u.dje “little feet’ 


However, as Katsika & Kavitskaya (2015) point out, there are no synchronic alterna- 
tions where the deletion of /r/ is accompanied by vowel lengthening. In other words, 
there are no attested examples in which one member of a semantically related pair has a 
surface [r], while the other exhibits a long vowel as a consequence of the r-deletion. On 
the basis of this, Katsika & Kavitskaya (2015) conclude that it would be more accurate 
to analyze r-zero alternation as a synchronic process, and CL through the loss of r as a 
sound change in Samothraki Greek. 

CL through onset loss presents a problem for the theories that treat CL as weight con- 
servation (such as Hayes 1989), which predict that only the deletion of coda consonants 
can result in vowel lengthening. It is generally assumed that, unlike codas, onsets cannot 
bear weight and do not count as moraic.? Several such problematic cases, including CL 
through onset loss in Samothraki Greek, are reanalyzed in Hayes (1989). Hayes extends 
Newton's (1972) idea that rC clusters underwent vowel epenthesis of the form VrC — 
VriC — ViC and proposes that identical vowel epenthesis happened in Cr clusters as 
well, yielding CrV; — CVijrV; — CV;:. The deletion of the intervocalic r could then be 
followed by vowel coalescence, just like in other VrV — V: cases in Samothraki Greek. 

However, as shown by Topintzi (2006), the Samothraki Greek CL resists such a re- 
analysis since the deletion of the word-initial r cannot be accounted for by metathesis. 
In addition, Kiparsky (2011) claims that Hayes' analysis is problematic because it incor- 
rectly predicts the merger of the outputs of the r-deletion from CrV and VrV. While 
after the loss of r, the original "rV sequence where the vowel is accented becomes a long 
vowel accented on the first mora, as in 0rimi — Oiim ‘shard’, the original “VrV sequence 
where the second vowel is accented becomes a long vowel accented on the second mora, 
as in xara — xaá ‘joy’. However, Heisenberg (1934: 91) notes that if r-deletion results 
in a sequence of identical vowels with the stress on the second vowel, the stress shifts 
from the second vowel to the first one, as in /karávi/ — [kaav] ‘ship’. Newton (1972a: 79) 


8 In Samothraki Greek, unstressed high vowels /i/ and /u/ delete unless the deletion creates phonotactically 
unacceptable structures. Unstressed mid vowels Zei and /9/ raise to [i] and [u] (Newton 1972a: 79). 

? Ryan (2014) presents statistical evidence from stress and meter showing that onsets are factors in sylla- 
ble weight, though they are subordinate to the rhyme with respect to weight. For the discussion of the 
possibility of moraic onsets, see Curtis (2003), Davis (1999), among others. 
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interprets the stress shift as evidence for vowel contraction (coalescence), while Heisen- 
berg (1934: 90) and Margariti-Rogka & Tsolaki (2011) ascertain that the vowels remain 
separate and belong to different syllables in such cases. 

While the moraic weight approach does not seem to account for the Samothraki Greek 
CL, Kavitskaya (2002) proposes a phonetic/historical account. According to Kavitskaya 
(2002: 99), r is vocalic enough to be reinterpreted as additional vowel length. Kiparsky 
(2011) argues that neither purely phonetic models nor purely phonological (weight con- 
servation) models are sufficient to account for CL in Samothraki Greek. He develops 
an account that relies on the observation that r is excluded from the onset position 
cross-linguistically (Zec 2007). Typologically, high sonority segments are dispreferred 
in the onset, which is evident from the fact that many languages, such as Korean, vari- 
ous Turkic languages, Basque, Piro, Telefol, etc., do not allow rhotics in word-initial or 
syllable-initial positions (de Lacy 2001; Smith 2003) even though they have some type of 
r in their consonant inventories. Languages employ different strategies to avoid onset 
rhotics, such as prothesis, deletion, fortition, anti-gemination, and incorporation into 
the nucleus (Kiparsky 2011: 26). Specifically, in Samothraki Greek the prohibition on 
the rhotic in the onset is resolved through the latter strategy: the rhotic is syllabified 
as a part of the nucleus so that the r and the following vowel form a rising diphthong, 
and then deletes with CL. Katsika & Kavitskaya (2015) develop an articulatory phonetic 
account of Samothraki Greek CL that builds both on Kavitskaya (2002) and Kiparsky 
(2011). To resolve the dispreference for the onset rhotic, the tongue tip constriction of 
the r is deleted, but the tongue body constriction is kept, preserving some of the seg- 
mental and temporal information of the r. The resulting segment is highly vocalic and 
is subsequently incorporated into the nucleus. Thus, Katsika and Kavitskaya’s (2015) 
account provides articulatory motivation to Kiparsky’s idea that in Samothraki Greek, 
the onset r goes through a vocalic stage followed by the coalescence with the following 
vowel. 

We can thus conclude that the best analysis of Samothraki Greek CL treats it as a 
two-stage process, under which the vocalization of the onset r happens first, followed 
by the coalescence of the two vocalic elements.” 


3.4 Towards the explanation of CL as a sound change 


From the point of view of contrast maintenance and loss, CL can be described as the 
loss of contrast in a certain position. In the case of CL through the loss of consonants, 
it is usually the coda consonant that deletes with the lengthening of the tautosyllabic 


10 A reviewer points out that there are cases when the deletion of a coda consonant happens simultaneously 
with intervocalic deletion of the same consonant, as, for example, in Turkish (de Chene & Anderson 1979, 
Kavitskaya 2002: 23). The reviewer suggests that this renders such examples consistent with de Chene 
and Anderson’s generalization. If the same was the case in Samothraki Greek, and the coalescence was 
phonetically complete at the time of r-deletion, this, by itself, would be enough to exclude Samothraki 
Greek from potential counterexamples to de Chene and Anderson’s generalization. I believe, however, 
that CL through onset loss in Samothraki Greek is best re-analyzed as vowel coalescence, in the spirit of 
Kiparsky (2011). 
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vowel. In a system with no phonologically long vowels, the result of this process could 
in principle be the introduction of a new vowel length contrast (the phonologization of 
vowel length in a narrow sense). However, in the case of the pre-existing vowel length 
distinction, the result is the introduction of the merger of the new long vowels with 
existing long vowels (the phonologization of vowel length in a certain position, in a 
broader sense of phonologization). 

On the basis of the examples discussed above as well as other instances of CL, it an 
be argued that CL as a sound change should indeed be defined as the lengthening of 
the vowel after the loss of the tautosyllabic consonant as a result of the phonological 
reanalysis of the additional vowel length, either in the spirit of de Chene & Anderson 
(1979) or of Kavitskaya (2002). De Chene & Anderson (1979) dub this process “monoph- 
thongization", that is, roughly, a vowel shift under which a monosyllabic vowel of two 
vowel qualities becomes a monosyllabic vowel of one vowel quality (as exemplified by 
Old French and Komi Izma, among many others). From our admittedly incomplete sur- 
vey of CL, some kind of a pre-existing length contrast is a necessary condition for CL 
in the cases where such reanalysis is involved, and no clear cases that counterexemplify 
this prediction have yet been found if this contrast is interpreted as including heterosyl- 
labic and heteromorphemic sequences of identical vowels. Thus, CL is best described as 
phonologization in a broader sense, that is, a merger of existing long vowels with new 
long vowels that are the result of CL. It is possible that Samothraki Greek could also be 
reanalyzed along these lines (as discussed in footnote 10), but, according to Kiparsky's 
(2011) account and the phonetic evidence amounted in Katsika & Kavitskaya (2015), it 
stands out since the sound change goes through an intermediate stage, whereby the con- 
sonant becomes a full vocalic entity. CL in this case is a misnomer, and Samothraki Greek 
is really an instance of vowel coalescence, which is a well-known and uncontroversial 
source of vowel length in the languages of the world. 

We can thus conclude that Samothraki Greek is not a counterexample to the gener- 
alization because the lost consonant vocalizes completely and then vowel coalescence 
happens, with the result that is reminiscent of CL as it has the initial stage of consonant 
plus vowel and a final stage of a long vowel, but is not, in fact, CL, but rather an instance 
of VV » V: coalescence. In turn, Old French is not a counterexample because, even if 
Gess's analysis of the Old French on the basis of the metrical scansion is correct, the 
presence of a sequence of identical heterosyllabic vowels, which are likely to be phonet- 
ically identical to long vowels, provides sufficient contrast. Finally, Komi Izma is not a 
counterexample to the most restricted version of the generalization because the presence 
of a sequence of identical heteromorphemic vowels is sufficient contrast. 


4 Sound change, mergers, splits, and contrast 


A broad question that remains to be discussed is the reason for why the CL sound change 
that proceeds by gliding followed by monophthongization (de Chene & Anderson 1979) 
tends to be structure-preserving, that is, is more likely to acquire vowel length in certain 
positions with the loss of the consonant if vowel length is already contrastive elsewhere 
in the system? 
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Two distinct proposals in the literature address the question of the relevance of struc- 
ture preservation to sound change. One view on the structure-dependence of sound 
change is expressed in Kiparsky (1995; 2003) and is to various extent present in other 
work by Paul Kiparsky. Another view on structure-preserving sound change is pre- 
sented in Blevins (2004a) and developed in Blevins (2009). As Anderson (2016) notes, 
Blevins and Kiparsky advocate quite different views on the explanation of sound change. 
While Blevins (2004a; 2006) puts the main burden of explanation of the sound change 
on the phonetic factors, Kiparsky (2006) in a critique of Blevins’ program views indi- 
vidual grammars as a result of both “what change can produce and of what the theory 
of grammar allows” (Anderson 2016: 17). Interestingly, both Blevins and Kiparsky see a 
place for structure preservation in the theory of sound change, either as belonging to the 
grammar (Kiparsky 1995; 2003) or emerging through acquisition (Blevins 2004a; 2009). 

Kiparsky (2003: 328) comments on “the textbook story” of phonologization, where re- 
dundant features become phonemic with the loss of conditioning environment (e.g., in 
the CL sound change, vowel length phonologizes with the loss of the tautosyllabic con- 
sonant). However, as Kiparsky (2003) points out, in many similar cases the redundant 
features fail to phonologize and disappear with the loss of the conditioning environ- 
ment. Kiparsky goes on to posit a priming effect, which is a diachronic manifestation of 
structure preservation, formulated as in (6): 


(6) Priming effect in phonologization (Kiparsky 2003: 328) 
Redundant features are likely to be phonologized if the language’s phonological 
representations have a class node to host them. 


Kiparsky (2003) distinguishes between the two types of sound change, perception- 
based and articulation-based, and claims that while perception-based changes, such as 
CL, metathesis, tonogenesis, and assimilation, are more likely to be structure-preserving 
(phonologization in a broad sense, as defined in §3.4), articulation-based changes, such 
as lenition, umlaut, etc., are usually structure-changing (phonologization in a narrow 
sense, as defined in §3.4). Among the structure-changing processes that create long 
vowels are vowel coalescence and also vowel lengthening in specific prosodic conditions 
(for instance, under stress). Kiparsky (2003: 329) notes that Korhonen (1969: 333-335) 
suggested that only certain allophones have a functional load that allows for the phone- 
micization with the loss of conditioning environment. Korhonen (1969) dubs these allo- 
phones quasi-phonemes. Having claimed that the classical phoneme is “something of a 
straightjacket” when it comes to understanding of the introduction and loss of phono- 
logical contrast, Kiparsky (2013) proposes a system, where he distinguishes between 
contrastiveness, as a structural notion, and distinctiveness, as a perceptual notion, as 
shown in Table 7. 

By the system in Table 7, quasi-phonemes are not contrastive, but distinctive, and 
thus they represent a necessary stage to the secondary split. Since distinctiveness is a 
perceptually-defined notion, only those sound changes that are perceptually-based are 
predicted to follow this pattern. As was discussed in §3.3, vowel length in Komi Ižma is 
quasi-phonemic and thus is a likely candidate for the phonologization of vowel length 
in the language. 
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Table 7: Contrastiveness vs. distinctiveness (Kiparsky 2013). 


Contrastive Non-contrastive 


Distinctive Phonemes Quasi-phonemes 
Non-distinctive Near contrasts Allophones 


Blevins (2004a; 2009) pursues a research agenda that is very different from Kiparsky’s 
theory of sound change. However, she also notes that certain sound changes tend to 
be structure-preserving, and that these changes tend to be perceptually-based. Blevins 
(2004a) posits a principle of structural analogy, stated in (7): 


(7) Structural Analogy (Blevins 2004a: 154) 
In the course of language acquisition, the existence of a (non-ambiguous) 
phonological contrast between A and B will result in more instances of sound 
change involving shifts of ambiguous elements to A or B than if no contrast 
between A and B existed. 


The consequence of such principle for sound change is a tendency towards structure 
preservation. Blevins (2009) presents an overview of two known cases of sound changes 
that have this tendency, such as CL (de Chene & Anderson 1979; Kavitskaya 2002) and 
metathesis (Blevins & Garrett 1998; Blevins 2004a,b; Hume 2004) and then proceeds to 
a case study of the Principle of Structural Analogy, unstressed vowel syncope in Aus- 
tronesian. 

According to Blevins (2009), unstressed vowel syncope in the Austronesian languages 
discussed is a perceptually-based sound change that is the result of the ambiguous vocalic 
status of hypoarticulated short unstressed vowel. The loss of the second vowel in a 
CV.CV.CV sequence creates a structure CVC.CV where the first syllable is closed. The 
Principle of Structural Analogy predicts that languages that contrast open and closed 
syllables will have a stronger tendency towards this kind of syncope. Indeed, as Blevins 
(2009) shows, the prediction is borne out. 

Blevins’ (2009) example is different from the case of CL in an interesting and funda- 
mental way. While CL as a sound change amounts to the introduction of a new allophone 
and potentially a new phoneme with a positional loss of a segment, unstressed vowel 
syncope is the introduction of a new prosodic structure with a positional loss of a seg- 
ment." This example provides additional support to the generalization that the presence 
of contrast in the system affects sound change that potentially creates similar structures. 


1 Asa reviewer notes, length is prosodic structure as well, and in this sense, there is little difference between 
the case discussed by Blevins and the cases of CL. However, while CL (potentially) introduces a new ele- 
ment to the inventory of phonemes, Blevins discusses an example that introduces a new structure to the 
inventory of syllables. 
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5 Conclusions 


De Chene and Anderson (1979) had an important insight about the structure-preserving 
nature of CL that holds in the majority of the languages with this sound change and 
thus cannot be ignored. I have presented examples in which systemic considerations 
play an important role in the phonologization of newly introduced phonetic detail in 
perception-based sound changes, such as vowel duration in CL. I have shown a way 
to address potential counterexamples to the generalization, reanalyzing CL in Samoth- 
raki Greek as vowel coalescence and arguing that in the cases of Old French and Komi 
Izma the presence of identical tautosyllabic vowels elsewhere in the system might have 
constituted a sufficient contrast for the phonologization of vowel length through CL. 
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Phonological exceptionality is localized 
to phonological elements: the argument 
from learnability and Yidiny word-final 


deletion 
Erich R. Round 


University of Queensland 


Anderson (2008) emphasizes that the space of possible grammars must be constrained by 
limits not only on what is cognitively representable, but on what is learnable. Focusing on 
word final deletion in Yidiny (Dixon 1977a), I show that the learning of exceptional phono- 
logical patterns is improved if we assume that Prince & Tesar’s (2004) Biased Constraint 
Demotion (BCD) with Constraint Cloning (Pater 2009) is subject to a Morphological Coher- 
ence Principle (MCP), which operationalizes morphological analytic bias (Moreton 2008) 
during phonological learning. The existence of the MCP allows the initial state of Con to be 
simplified, and thus shifts explanatory weight away from the representation of the grammar 
per se, and towards the learning device. 


I then argue that the theory of exceptionality must be phonological and diacritic. Specifically, 
I show that co-indexation between lexical forms and lexically indexed constraints must be 
via indices not on morphs but on individual phonological elements. Relative to indices on 
phonological elements, indices on morphs add computational cost for no benefit during 
constraint evaluation and learning; and a theory without indices on phonological elements 
is empirically insufficient. On the other hand, approaches which represent exceptionality 
by purely phonological means (e.g. Zoll 1996) are ill-suited to efficient learning. Concerns 
that a phonologically-indexed analysis would overgenerate (Gouskova 2012) are unfounded 
under realistic assumptions about the learner. 


1 Exceptionality 


What is the nature of representations which are passed from the morphology to the 
phonology? Anderson (1992) demonstrates that the processes that create those represen- 
tations can be elaborate and complex. Operations that act upon morphological forms, to 
realize units of morphologically-relevant meaning, involve not only the concatenation 
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of formatives, but also selection among alternatives and non-concatenative modifica- 
tions to intermediate representations (see also Anderson 2015; 2016; 2017). However, 
what of the final result, which comprises some number of morphs that must then be 
interpreted phonologically? A constant concern of generative phonology since its in- 
ception has been to account adequately for patterned phonological exceptionality, the 
phenomenon in which segments in a restricted class of morphs exhibit phonologically 
distinctive behavior as triggers, targets or blockers of alternations, or as participants in 
exceptional featural, phonotactic or prosodic surface structures. For example, in Yidiny 
(Dixon 1977a,b) vowels delete word-finally, if that deletion would prevent the word from 
surfacing with an unfooted syllable. This is seen in the root gayara- ‘possum’ in (1a) and 
the suffix -na ACCUSATIVE in (1b), where feet are marked by parentheses. However, in a 
restricted set of morphs the final vowel behaves exceptionally, resisting deletion, as in 
the root guzara- ‘broom’ (2a) and the suffix -na PURPOSIVE (2b). 

(1) a. ‘possum.aBs’ /gajara/ — (ga ja:r) 
b. ‘father-acc’ — /bimbi-na/ — (bimbi) 


(2 a. ‘broom.ass’ /guyara/ — (guga:)ra 
b. 'go-PURP' /gali-na/ — (gali:)na 


In order for the phonology to treat morph-specific, exceptional segments appropriately, 
it must receive from the morphology some kind of discriminating information which it 
can act upon. For much of the generative period it has been argued that this informa- 
tion is associated with morphs as a whole, and not with their individual phonological 
elements. Here I present an argument for the contrary view. The contribution, then, is to 
clarify the nature of one important aspect of the interaction between the morphological 
and phonological components of grammar. The principle line of evidence is learnability, 
namely the learnability of an optimality-theoretic grammar for phonological exception- 
ality. Anderson (2008) has emphasized that the space of possible human grammars must 
be constrained not only by limits on what is cognitively representable, but also on what 
is learnable. The crux of the argument here relies not on specifics, but ultimately on gen- 
eral properties of learnable grammars, and thus I would hope should remain valid even 
as specific theories undergo refinement as they move closer to answering Anderson’s 
(2008) challenge. 

The chapter falls into two broad parts. In §2-§5 I discuss the processes and principles 
required to learn exceptionality. This leads to the positing of a Morphological Coherence 


1 A reviewer asks whether the machinery presented here is necessary if one assumes an exemplar-based 
model of phonology. I assume that learners do store rich, exemplar-like representations of linguistic ex- 
periences. However, natural language morphology in general has enough combinatorial complexity that 
reliance upon retrieved episodes will not be sufficient to reproduce the full range of creative behavior that 
humans display. Consequently some generative machinery is necessary, which performs not merely sim- 
ple analogies and concatenations, but which can reproduce with precision the complex patterns generated 
by a realizational morphology such as Anderson's (1992), and by a formal phonological grammar such as 
entertained here. 
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Principle in §6, which operationalizes a morphological bias that ensures successful learn- 
ing for certain cases. In 87-89 I am concerned with the underlying theory of these pro- 
cesses and principles. I evaluate two broad approaches to phonological exceptionality: 
PHONOLOGICAL approaches, which represent exceptionality as a property of individual 
segments (Bloomfield 1939; Kiparsky 1982a; Inkelas 1994; Zoll 1996), and MORPHOLOGI- 
CAL approaches which represent it as a property of morphs (Chomsky 1964; Chomsky & 
Halle 1968; Zonneveld 1978; Pater 2000). The result is an argument in favor of a DIACRITIC 
PHONOLOGICAL approach. On this account, exceptionality is represented at the level of 
individual phonological elements, not morphs; however the means of marking it is by 
diacritics which are visible to the phonology but not manipulable by it, in contradistinc- 
tion to the CONCRETE PHONOLOGICAL approach, where the crucial representations are 
themselves phonological elements. As I show, the function of these “®-indices” is essen- 
tially identical to “M-indices” which would mark morphs, only there is no assumption 
that all exponents of a morph m be indexed identically. As we shall see, freedom from 
that assumption is both coherent theoretically and desirable, computationally and em- 
pirically. The discussion is illustrated throughout by the facts of word final deletion in 
Yidiny, to which we turn now in §2. 


2 Word-final deletion in Yidiny 


2.1 The phenomenon 


Yidiny (Dixon 1977a) belongs to the Yidinyic subgroup of the Pama-Nyungan language 
family. Traditionally it was spoken in the rainforest region southwest of Cairns, in North- 
eastern Australia. Most examples below are from Dixon’s (1977) detailed descriptive 
grammar; examples marked T are from Dixon’s (1991) dictionary and texts. An inventory 
of underlying segments is in Table 1. 


Table 1: Yidiny underlying segments, after Dixon (1977a: 32). 


Labial Apical Laminal Dorsal 


Stop b d j g 
Nasal m n n I 
Lateral, trill Lr 

Approximant w l y 

Vowels i, a, u, it, a:, u: 


Syllable shapes are tightly constrained. Onsets are obligatory and simple. Codas 
permit only sonorants other than /w/. Codas in word-final position are simple; word- 
internal codas also permit disegmental, continuant-nasal sequences. Morphologically, 
the language is almost entirely suffixing and largely agglutinative. Roots are minimally 
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disyllabic and suffixes are maximally disyllabic (Dixon 1977a: 35,90). An online ap- 
pendix? discusses the morphological constituency of verbal inflection. 

Of Yidiny’s phonological alternations, those to receive the greatest attention have 
been stress placement, vowel length and to a lesser extent, word-final deletion (Dixon 
1977a,b; Hayes 1982; 1985; Kager 1993; Crowhurst & Hewitt 1995; Halle & Idsardi 1995; 
Hall 2001; Pruitt 2010; Hyde 2012; Bowern, Round & Alpher in revision, inter alia). 
Yidiny’s stress and length alternations in particular have featured in significant theo- 
retical works on meter and prosody over the past four decades, and both are nontrivial 
topics in themselves. Word-final deletion, however, can be studied largely independently 
of them for reasons that follow. 

Although stress placement in Yidiny has proven contentious (Pruitt 2010; Bowern, 
Round & Alpher in revision), word-final deletion is not sensitive to stress per se, but 
rather only to the position of foot boundaries. These have been uncontroversial since 
their analysis by Hayes (1982): feet in Yidiny are disyllabic and left-aligned within the 
phonological word. 

Many words with word-final deletion also exhibit vowel lengthening; however the 
phenomena show little to no mutual interaction. In a rule-based theory permitting si- 
multaneous application (Anderson 1974) lengthening and deletion would apply simulta- 
neously; neither rule feeds or bleeds the other? See Round (in progress) for an analysis 
of Yidiny lengthening. 

Word-final deletion is sensitive to foot placement, and foot placement is sensitive 
to phonological word boundaries. In Yidiny, phonological words commence at the left 
edge of each root and each disyllabic suffix (Dixon 1977a: 88-98).* Phonological words 
therefore begin with either a polysyllabic root or a disyllabic suffix and are followed by 
zero or more monosyllabic or entirely consonantal suffixes. Word-final deletion targets 
unfooted syllables and therefore only affects prosodic words which, modulo deletion, 
would be at least trisyllabic. As a consequence, we are interested here in three kinds 
of phonological word: those comprised of bare roots of three or more syllables; those 
comprised of roots plus one or more monosyllabic suffixes; and those comprised of a 
disyllabic suffix plus one or more additional, monosyllabic suffixes. The third kind is 
rare, and so discussion will focus on the first two. 

Word-final deletion applies only if the word thereby avoids surfacing with an unfooted 
syllable. For example, the roots gindanu- ‘moon’ and gubuma- ‘black pine’ both contain 
three vowels, each of which is a potential syllabic nucleus at the surface. In (3a,4a) they 
have undergone deletion of their final vowel to prevent it from surfacing in an unfooted 
syllable; compare (3b,4b) where the roots are non-final in the word, and the final vowels 
surface. 


2 Available from 10.6084/m9. figshare.4579696 

3 Deletion counter-bleeds lengthening, thus in a strictly serial analysis lengthening would precede word- 
final deletion (Dixon 1977a,b; Hayes 1985; Crowhurst & Hewitt 1995). 

4 Yidiny’s only prefix, [ja:-] ‘in a direction’ occupies its own phonological word (Dixon 1977a: 98,162). 

5 For an illustration, see example (25). 
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(3) a. 'moon[Ass| /gindanu/ — (gin da:n) 
* (gin da:) nu 
b. ‘moon-ERG’  /gindanu-ggu/ — (gin da) (nun gu) 


(4) a. ‘black pine[aBs]’ /gubuma/ — (gu bu:m) 
* (gu bu:) ma 
b. ‘black pine-purP’ /gubuma-gu/ — (gu bu) (ma gu) 


Final vowel deletion may also affect suffixes. In (5a,c,6a), the vowels of the nominal 
comitative suffix -yi and verbal comitative suffix -ņa have undergone deletion, thereby 
preventing the surfacing of an unfooted syllable. In (5b,6b) the suffixes are non-final in 
the word, and the vowel surfaces. 


(5) a. “woman-com’ /buna-yi/ — (bu nary) 
* (bu na:) yi 
b. ‘woman-com-ERG’ /buna-yi-ngu/ — (buna) (yin gu) 
c. ‘black bream-com’ /gulugulu-yi/ — (gu lu) (gu lu:y) 
*(gu lu) (gu lu:) yi 
(6) a. ‘come-com[imp]’  /gada-ga/ — (ga da:n) 
*(ga da:) ga 


b. 'come-cow-PsT /gada-na-Inu/ — (ga da:) (yal nu) 


Word-final deletion interacts with restrictions on word-final consonants, and the in- 
teraction plays out differently in roots versus suffixes. In roots, deletion will fail to apply 
if the result would be an illicit word-final coda, containing either a stop or /w/ (7) or a 
cluster (8). One conceivable alternative, to also delete the consonant, is not attested in 
roots (7-8). 


(7) a. ‘man[ass]’ /waguja/  — (wagu:) ja 

* (wa guy) 
* (wa gu:) 

b. ‘dog[ass]’ /gudaga/ | — (guda:) ga 
“(gu da:g) 
* (gu da:) 

c. ‘sugar ant[ABs]’ /balawa/ — (bala:) wa 
*(ba la:w) 
* (ba la:) 

d. ‘place name[ABs]  /galumba/ — ` ma lu:m) ba 
*(ga lu:mb) 
*(ga lu:m) 


6 Neither Dixon’s grammar (1977) nor dictionary (Dixon 1991, which cites underlying forms) records a surface 
form for the roots in (7c) and (7d), or for roots illustrating the same pre-final consonant or comparable 
consonant clusters. However, Dixon (1977a: 57-58) specifically reports that the roots balawa- and gindalba- 
do not undergo deletion; the surface forms provided here are what we would expect if this is so. 
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(8) ‘warn[imp]’ /binarna/ — (bina:r) ya 
* (bi na:rn) 
*(bi na:r) 


In contrast, deletion in suffixes applies not only to the final vowel, but also to a sin- 
gle consonant that precedes it, if that consonant would be illicit word-finally, as in (9). 
This form of CV deletion respects phonotactic constraints while also avoiding unfooted 


syllables.’ 


(9) a. ‘grey possum-ERG’ /margu-ngu/ — (mar gun) 
b. ‘see-PsT’ /wawa-lnu/ — (wa wa) 
c. warn-DAT.SUB' /binarga-Inu-nda/ —» (binar) (yal nun) 


However, word-final deletion never deletes the initial segment of a suffix (and conse- 
quently, it will never delete an entire suffix), as illustrated in (10). 


(10 a. ^woman-sET' /buna-ba/ —  (buna:) ba 
*(bun ba) 
b. ‘bandicoot-cEn’ /guygal-ni/  — (guy ga:l) ni 
"(guy ga:ln) 
"(guy gail) 


Deletions do not occur word internally (11a,b), nor do word-final, licit codas delete (11b). 
All Yidiny roots and suffixes that are consonant-final end underlyingly with licit coda 
consonants, so no morph undergoes spontaneous deletion of an underlyingly-final con- 
sonant (11c). 


(11) a.  ‘woman-sET’ /bupa-ba/ —  (bupa:b) 
* (bu pa:) 
b.t ‘name[ass]’ /bagiram/ — (ba gi:) ram 
*(ba gi:rm) 
* (ba gi:r) 
c. "lbagirag/ | — "(ba gi:) 


To summarize, word-final deletion applies only so as to avoid the surfacing of unfooted 
syllables. It may delete the final vowel from a root and the final (C)V sequence from 
a suffix, but will not delete a suffix-initial segment. Deletion is blocked (in roots) or 
expanded (in suffixes, from V deletion to CV deletion) in order to obey phonotactic re- 
strictions on word-final codas. These are the regular conditions under which word-final 
deletion occurs. 

In addition to its regular application, Yidiny contains roots and suffixes which are 
exceptional non-undergoers of word-final deletion. In (13), the non-undergoer roots 


7 The *dative subordinate" is marked by what Round (2013: 26) has called *compound suffixation", comprising 
two monosyllabic suffixal morphs, /-lnu; -nda/. That these are not a single, disyllabic suffix is evident in 
the fact that they fail to be parsed into a their own phonological word, separate from the root. 
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mulari-, guyara-, yudulu-, bangamu- all resist word-final deletion despite their pre-final 
consonant being permissible as a coda, and despite the fact that the consequence is an 
unfooted, word-final syllable. 


(12) a. ‘initiated man[ABs]  /mulari/ —  (mula:)ri 
* (mu la:r) 
b. "broom[Ass]' /guyara/ — (gu ga:) ra 
* (gu jar) 
c. ‘brown pigeon[ass]’ /sudulu/ — (ju du:) lu 
* (yu du:l) 
d. potato[ABs]' /bangamu/ — (bag ga:) mu 
"(ban ga:m) 


Dixon (1977a: 59) reports 115 trisyllabic roots whose phonotactic shape would, under 
regular conditions, expose them to word-final deletion. Of these, 34, or around 30%, 
are exceptional non-undergoers. The distinction is idiosyncratic; neither Dixon (1977a: 
58) nor subsequent researchers have found any phonological, semantic or grammatical 
factor that categorically determines whether a root will be a non-undergoer.® 

Suffixes also may be exceptional non-undergoers. In (17) the non-undergoer suffixes 
-nda, Jet and -na resist word-final deletion and allow an unfooted syllable to surface. 
Avoidance of regular, word-final CV deletion is seen in (13a,b) and V deletion in (13c). 


(13) a. ‘grey possum-bAT' margu-nda — (mar gu:n) da 
* (mar gu:n) 


b. ‘see-LEST[ABS]’ wawa-lji — (wa wail) gi 
* (wa wa:l) 

c. 'go-PURP' gali-na —  (gali)na 
* (ga li:n) 


Tables 2 and 3 list all suffixal allomorphs in Yidiny which, on phonotactic grounds, could 
plausibly delete.? Regular undergoers are in Table 2 and non-undergoers in Table 3. 


5 Historically speaking, borrowed forms may account for many of these items (Barry Alpher p.c.); synchron- 
ically, however, their motivation is opaque. 

? Such suffixes must be vowel-final and monosyllabic. If just the final vowel is to delete, then it must leave 
behind a single, licit-coda consonant in word final position. This will require the suffix to be -CV, and be 
preceded by a vowel, not a consonant. Alternatively, if the final CV is to delete, then the suffix must be 
-CCV, since suffix-initial segments do not delete, and it too must attach to a vowel-final stem. Data here is 
from a comprehensive search of Dixon (19772), in which relevant information can be found on pp.50-54, 
151. “Emphatic” -pna (Dixon 19772: 151) is excluded. It behaves as a phonological clitic that occupies a distinct 
phonological word, and does not undergo final deletion. 
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Table 2: Monosyllabic suffixes which undergo word-final deletion. 


Function -CV -CCV 
Case ERGATIVE -ngu 
LOCATIVE -la 
ACCUSATIVE -na 
COMITATIVE -yi 
GENITIVE -ni, -nu 
Verbal PAST tense inflection -nu -]nu, mu 
COMITATIVE derivation -ya 
DATIVE SUBORDINATE inflection -nda!? 


Table 3: Monosyllabic suffixes which escape word-final deletion. 


Function -CV -CCV 

Case DATIVE -nda 

Verbal PURPOSIVE inflection -na -lna, ma 
LEST nominalizing derivation -nji, -]ji, 219 


Exceptional non-undergoers, both roots and suffixes, only block the deletion of their 
own segments; the exceptionality does not spread to neighboring morphs. Accordingly 
n (14), the exceptional non-undergoer Lest does not block deletion in the following, 
regular undergoer, ERGATIVE suffix. 


(14) wiwi-ji-nd-jgu  'give-ANTIP-LEST-ERG' — (wi wi:) (jin ji:n) 
"(wi wi:) (jin ji: gu 


Likewise, the presence of a regular undergoer will not undo the blocking effect of an ex- 
ceptional non-undergoer. In (15) the regular undergoer COMITATIVE does not undermine 
the blocking of deletion in the exceptional non-undergoer PURPOSIVE, which follows it. 


(15) majinda-ga-lna ‘walk up-com-puRP’ — (maşin) (da ga:l) na 
"(ma jin) (da nal 


It will be recalled that roots in Yidiny can undergo word-final deletion of vowels, but 
not of the consonants that precede them. More specifically, roots that end in CCV do 
not delete final CV, whereas some suffixes do, and nor does final C'V delete from roots 
that end in VC'V, where C is an impermissible coda. Two conceivable accounts for this 


10 The dative subordinate is marked by a string of two monosyllabic suffixes -Ipu-nda, cf. fn.7. 
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may be distinguished. On one account, the grammar of Yidiny expressly prohibits root- 
final CV deletion. On the other, it happens just by chance that all CCV-final and VC'V- 
final roots are exceptional non-undergoers. On the latter account, the grammar WOULD 
enforce CV deletion from roots, if only the lexicon provided the right inputs; on the 
former account it would not. The level of empirical support for these hypotheses can be 
assessed statistically. Table 4 compares counts of CCV- and VC V-final roots and CCV- 
final suffixes which either do or do not delete. The distribution is strongly unbalanced, 
and we can reject with confidence the null hypothesis that it is due to chance (Dor 
= 479 p < 10?). Table 5 compares counts of roots that are CCV- and VC'V-final with 
those that are VCV-final, i.e., where C is a permissible coda. Again, the counts are highly 
unbalanced and we reject the hypothesis that the absence of deletion in CCV- and VC'V- 
finals is by chance (x?ari = 125.8. p < 10). The only empirically-supported conclusion 
is that the lack of consonant deletion in Yidiny roots is systematic, not due to chance. A 
satisfactory formal analysis should reflect this." 


Table 4: Deletion of coda-ilicit pre-final C in roots versus suffixes. 


CCV- and VC V-final roots CCV-final suffixes 


No deletion 116 6 
Deletion 0 4 


Table 5: Deletion in roots with pre-final coda-illicit C versus prefixal coda-licit 


C. 


CCV- and VC V-final roots VCV-final suffixes 


No deletion 116 34 
Deletion 0 81 


2.2 Constraint rankings 


A briefly sketch now follows of how the facts above would be analysed in OT. Foot place- 
ment in Yidiny is due to FooTBINARITY >> PARSESYLLABLE > 

ALIGN(FT,L,PRWD,L) (Prince & Smolensky 2004[1993], McCarthy & Prince 1993a, Mc- 
Carthy & Prince 1995). Of these, only PARSESYLLABLE (Pns) will be of interest for our 
purposes; I assume that other prosodic constraints are satisfied optimally. Absolute re- 


11 As a reviewer observes, there is an interesting historical background to be clarified here, and an account 
of it is planned. Naturally, the object of a synchronic analysis differs ontologically from that of a historical 
one. The two are complementary, but neither account would substitute for or serve as a counter-analysis 
to the other. 
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strictions against obstruents and /w/ in codas are due to SONORANT/CopaA (e.g. Lombardi 
2002) and *w/Copa; I assume these are unviolated. 

Regular word-final deletion in Yidiny can be analysed straightforwardly by ranking 
PRs >> MAxIMALITy (Max, McCarthy & Prince 1995). This causes deletion of final vowels 
in preference to the surfacing of unfooted syllables, but not if an illicit coda results. 

Segments may delete from the right edge of the word only, not the left or word- 
internally. High-ranking ANcHoR-LEFT(morph) penalizes deletion from the left edge 
of any morph and Contic-IO(PRWD) penalizes deletion internally (McCarthy & Prince 
1995). 

Yidiny permits complex codas word-internally, but not word-finally. Ranking CNTG 
>> *CoMPLEXCODA (Bernhardt & Stemberger 1998) accounts for this; ranking both above 
Pns accounts for the absence of deletion after pre-final clusters in roots and the defeat 
of candidates which delete only a final vowel from word-final CCV suffixes. 

Word-final deletion applies differently to roots and suffixes. Roots will not undergo 
consonant deletion, even if the consequence is an unfooted syllable. The ranking of un- 
dominated MAx-C/noor (McCarthy & Prince 1995) above Prs accounts for this. Suffixes 
do not violate MAx-C/nr and consequently are free to undergo consonant deletion, how- 
ever highly-ranked Anc penalizes the deletion of morph-initial segments. This accounts 
for the fact that a consonant may delete from a -CCV suffix but not from -CV. 

At this point, regular word-final deletion occurs whenever satisfaction of the marked- 
ness constraint Pns requires the violation of the lower-ranked faithfulness constraint 
Max. Deletion is blocked unexceptionally whenever Prs itself is violated in order to sat- 
isfy higher-ranking constraints, which are of two kinds: those which penalize marked 
codas, soN/ConA, *w/Copa, *Crrx; and those which penalize deletion in specific mor- 
phological contexts, namely at left edges of morphs, ANc, and consonants in roots, MAx- 
C/RT. We see that the driver of word-final deletion in Yidiny is the ranking of Prs > 
Max. Deletion occurs when Prs is satisfied but Max is not. Regular blocking results 
when Prs must be violated, in which case Max can be satisfied. 

Exceptional non-undergoers avoid deletion. For them, Max is always satisfied, even 
at the expense of Pns. Consequently, while regular undergoers are subject to a ranking 
of Prs >> Max, exceptional non-undergoers must be subject to Max > Prs. In §4 I 
consider two approaches that will ensure this is the case, one morphological and one 
phonological. First though, a remark about constraint violations. 


3 Relativized constraint violation 


I introduce here a simple expression for relating the violations of certain pairs of con- 
straints, which will aid discussion in later sections. 

For any constraint C and candidate cAND, there will be zero or more violations of C. 
Given the definition of C, those violations will be due to certain parts, or loci, in CAND, ei- 
ther in the output of cann or in the correspondences between input and output elements 
(McCarthy & Prince 1995). We can define the set of LocI or VIOLATION, V(C, CAND), as 
the loci in cAND which cause violations of C (McCarthy 2003, Lubowicz 2005). Now, 
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some pairs of constraints C4, C? are related such that for any CAND, the loci of violation 
of C; are a subset of the loci of violation of C4. In many cases, the latter are precisely 
those members of the former which also contain some particular kind of phonological 
element. For example V(MAx-C, CAND) are those members of V(Max, CAND) which also 
contain input consonants. In that case, we can express V(C2, CAND) terms of the INTER- 
SECTION of the set V(C;, CAND) and some appropriately defined second set, that picks out 
loci containing the criterial elements. Let us define the set of “g-loci”, Ly(D(¢), CAND), 
as the set of loci in CAND that contain a phonological element o of the kind denoted by 
predicate D(g). For example, V(MAx-C, CAND) can be defined in relative terms, as in (16), 
where the predicate INPUT_CONSONANT(@) denotes input consonants. (For brevity I omit 
the "cAND" from the expression for each set.) 


(16) V(MaAx-C) =4ef VMAX) N L;(iNPUT. CONSONANT(Q)) 


This relativized method will be used below to define new constraints Cy in terms of a 
reference constraint, Cg, and a set of phonological elements which restrict the violations 
of Cy relative to those of Cg. 


4 Preliminary analysis of word-final deletion 


4.1 A morphological approach 


We now consider an OT implementation of the morphological approach to Yidiny ex- 
ceptionality, using lexically indexed constraints (Pater 2000; 2006; 2009). A lexically 
indexed constraint Cu behaves precisely like its unindexed counterpart, C, except that it 
can be violated only by structures which contain exponents of a specific set M of morphs, 
each of which has been assigned a diacritic mark which I will term a LEXICAL M-INDEx, 
that co-indexes it to Cy. The definition can be expressed relatively as in (17), following 
a similar formulation by Finley (2010). 


(17) V(Cm) =aer V(C) à Lo (meM E Exp(e, m)), where: 
M is the set of morphs co-indexed to Cy. 
Ex»(q, m) states that element o is an exponent of morph m 


If we now define two sets of Yidiny morphs, U the set of regular undergoers of word- 
final deletion, and N the set of exceptional non-undergoers, then either of the rankings 
in (18) will ensure that the correct sets of morphs is subject to the desired partial ranking 
of Prs and Max. 


(18) a. Prsy > Max > PRS 
b. Maxw > Prs > Max 


In (18a), all phonological exponents of undergoer morphs will be subject to PRsu > 
Max, and non-undergoers to MAx > Pns. In (18b), all phonological exponents of non- 
undergoer morphs will be subject to MAxw > PRs, and undergoers to Prs > Max. For 
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now I will use ranking (18a); the reason for this will become clear in $5.7? Examples in 
(19a-19b) illustrate word-final deletion of regular undergoers which are indexed U, the 
root malanu-y and suffix ERGATIVE -nguy, while (19c-19d) show the absence of deletion 
for exceptional non-undergoers mulari- ‘initiated man’ and DATIVE nda. 


(19) Prsy | Max | Prs 


a. /malanuy/ ‘right hand[ass]’ w L Ww 


(ma la:n) > (ma la:) nu 


b. /margu-gguu/ ‘grey possum-ERG’ 
(mar gu:n) > (mar gun) gu 
c. /mulari/ ‘initiated man[Ans]' 


W L 
(mu la:) ri > (mu la:r) 
d. /margu-nda/ 'grey possum-DAT' Ww L 
(mar gu:n) da > (mar gu:n) 
e. /majinda-gapy-Ina/ ‘walk up-cow-PunP' w L 


(ma jin) (da ya:l) na > (ma jin) (da nah 


Example (19e) illustrates the fact that violations of PRsy require not merely the pres- 
ence of a U-indexed morph in the word, but a locus of violation which contains a phono- 
logical exponent of a U-indexed morph (17). Namely, the final syllable of (19e), na, is un- 
footed. However since that syllable contains no phonological exponent of a U-indexed 
morph, no violation of Prsy results. This is true despite the presence of a U-indexed 
morph elsewhere in the word. 


4.2 A phonological approach 


The phonological approach correlates the (un)exceptionality of a segment with represen- 
tational properties of the segment itself. Implementations differ as to which property is 
used. Zoll (1996) analyses segments which resist deletion as having root nodes in their 
input, whereas segments that delete more readily lack root nodes, and are termed suB- 
SEGMENTS. Under these assumptions, a ranking Max(SEG) >> Prs >> MaAx(SuBSEG) en- 
sures that segments with input root nodes are subjected to Max(SEG) > Prs, while those 
without are subjected to Prs > Max(Susssc).'4 Examples are in (20), where segments 
without root nodes are underlined. 


12 Briefly, procedures for learning OT grammars improve in performance if they opt to rank markedness 
higher than faithfulness when given a choice. Consequently the ranking in (18a) will be learned in prefer- 
ence to (18b); see $5. 

D An early proposal that only faithfulness constraints be indexable (Benua 1997; Itó & Mester 1999; Fukazawa 
1999) has proven untenable (Pater 2000; 2006; Flack 2007b; Flack 2007a; Inkelas & Zoll 2007; Gouskova 
2007; Mahanta 2008; Jurgec 2010). 

14 Assuming undominated *FLoat (Myers 1997), which prohibits surface subsegments, and low-ranked 
Dep(Root) (Zoll 2001). 
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(20) Max(SEc) | Prs | Max(SuBsEc) 

a. /malanu/ " L 
(ma la:n) > (mu la:) nu 

b. 
/margurgu/ w L 
(mar gu:n) > (mar guy) gu 

c. /mulari/ 

W L W 

(mu la:) ri > (mu la:r) 

d. - 
/margu-nda/ " L " 
(mar gu:n) da > (mar gu:n) 


I wish to draw a distinction now between two conceivable kinds of phonological anal- 
ysis. A CONCRETE phonological analysis represents exceptionality using regular phono- 
logical material, such as features, root nodes and prosodic units, or perhaps their ab- 
sence. An ABSTRACT phonological analysis uses diacritic lexical indices, which I will 
term LEXICAL ®-INDICES, on segments, much like the morphological analysis uses lexical 
M-indices on morphs. Some objections which have been raised to phonological analyses 
are specific to the concrete approach. These include doubts over whether sufficiently 
many concrete phonological contrasts would be available in languages with very many 
exceptional patterns (Gouskova 2012), and concerns over whether learners can choose 
between multiple, alternative concrete representations (Kiparsky 1973, Pater 2009). I 
will set these concrete-specific concerns aside for now, and instead assume an abstract 
phonological approach. I return to the concrete approach in $9, where I argue on inde- 
pendent grounds that it is poorly adapted to efficient learning. 

Accordingly, I will use lexical ®-indices u and nto index undergoer and non-undergoer 
segments respectively, and define ®-indexed constraints, Co, in relative terms as in (21). 


(21) V(Co) =dep V(C) NA Lo (oe), where: 
9 is the set of phonological elements co-indexed to Co. 


Returning to the phonological account of Yidiny exceptionality, a constraint ranking 
Max-n > Prs > Max, or Pns-u > Max > Prs, will be sufficient for our purposes. 
Tableau (22) shows examples using the latter ranking; u-indexed segments are under- 
lined. 


(22) Prs-u | Max | Prs 
a. (mala:n) > (mu la:) nu W L W 
b. (mar gu:j) > (mar guy) gu W L 
c. (mula:)ri > (mu la:r) W L 
d. (mar gu:n) da > (mar gu:n) W L 
e. (ma jin) (da ga:l) na > (ma jin) (da na:l) W L 
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A recent criticism of the phonological approach to exceptionality in OT is that it over- 
generates (Gouskova 2012). Adapting Gouskova’s arguments to the facts of Yidiny: if 
we adopt the ranking Prs-u > Max > PRs, then it is no longer necessary to assign a 
high ranking to the morphologically-sensitive constraints Anc and MAx-C/Rr, which 
penalize the deletion of morph-initial segments and root consonants. Rather, so long as 
all morph-initial segments and all root consonants lack a lexical u-index, then by virtue 
of the partial ranking Max > Prs, they will resist deletion irrespective of the ranking of 
Anc and Max-C/RT. By the same token however, if Anc and MAx-C/Rr do receive a low 
ranking, then the analysis will fare poorly in the context of Richness of the Base (Prince 
& Smolensky 2004-1993]), since without high-ranked Anc and Max-C/RT ensuring that 
morph-initial and root-consonant deletion is impossible, there is nothing to prevent seg- 
ments from deleting in those positions if they are u-indexed in the lexicon. For example, 
a root such as *binarna could undergo CV deletion; a suffix *-ni could delete entirely; 
and *mulari could delete from the left, thereby failing to capture the generalization that 
the absence of such forms is not an accident of the lexicon, but a systematic property 
of the grammar. This is perhaps the most significant apparent flaw of the phonological 
approach: it fails to rule out unattested patterns. This is in contrast to the morphological 
approach, which does rule them out. Or at least, so it would seem. In 85 I show that the 
true situation can be otherwise, once learning is taken into account. 


4.3 Alternatives 


Before proceeding to learning, I mention two OT alternatives to the analysis of excep- 
tionality in Yidiny word-final deletion. 

Co-phonological approaches handle exceptionality as a type of cyclicity effect (Orgun 
1996, Kiparsky 2000, Inkelas & Zoll 2007, Bermüdez-Otero 2016). On each morphological 
cycle the result of a morphological operation is submitted to an appropriate phonological 
subgrammar, of which the language may possess many. Problematic for any cyclicity- 
based approach to exceptionality in Yidiny word-final deletion is that the Yidiny case is 
non-cyclic. Instead, undergoers are subject to deletion only if word-final. For example, 
in building both words in (23a,b) the first step would be to introduce the undergoer 
root bigunu- ‘shield’. However at that point, the “deleting” subgrammar should only be 
applied if the root will end up word final, as in (23a) but not in (23b). 


(23) a. ‘shield[aBs]’ /bigunu/ — (bi gu:n) 
b. ‘shield-comit-ERG’ /bigunu-yi-ngu/ — (bigu) (nu yi:n) 
"(bi gun) (yin gu) 


Selecting the correct subgrammar in (23) thus requires information about the next 
step in the derivation. Crucially though, it requires forewarning, not only of whether or 
not there is more morphology to come, but also of what the PHONOLOGICAL ramifications 
will be. This is because the relevant domain for word final-deletion in Yidiny is not the 
morphological word but the prosodic word. For example, in (24) the roots gayula- ‘dirty’ 
and gumai- ‘red’ are followed by suffixes. Since the suffixes are monosyllabic, just one 
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prosodic word results and the roots are non-final in their prosodic word. In (25) however, 
the roots are followed by the disyllabic INcHOATIVE suffix daga, which commences a 
second prosodic word. As a consequence, the roots are final in their prosodic word and 
deletion is possible: the undergoer gajula- deletes while the non-undergoer gumayi- does 
not. 


(24) a.  'dirty-cAus-PsT' 
/gayula-na-Inw/) — — ` [(ga yu) (la pa:I)pwa] 
b. 'red-cAus-PsT' 
/gumaga-pu/  — ` Heu ma) Qi na‘l)pwal 
(25) a. ‘dirty-INCHO-PsT’ 
/gajyula-daga-nu/ — — [(ga ju:l)pwa] Ida ga:n)pwal] 
b. 'red-iNCHO-PsT' 
/gumayi-daga-nu/ — [(gu ma:) {ipwa] [(da ga:n)pwa] 


Any cyclic, look-ahead mechanism in Yidiny would therefore need to know how the 
word would be prosodically parsed on the NExT cycle, before it can decide whether or 
not to apply the “deleting” subgrammar on the current cycle. The look-ahead mecha- 
nism would therefore require the power of a subgrammar itself, yet if the theory were 
augmented in this manner, then other core mechanisms such as scope, or “bracket era- 
sure”, effects (Inkelas & Zoll 2007) would be undermined. I conclude that co-phonology 
theory as it stands cannot analyse exceptionality in Yidiny word-final deletion. 

Another approach would be to lexically list two allomorphs for all undergoer morphs 
in the language, and have the grammar select them either optimally (Mester 1994, Kager 
1996, Mascaro 1996, and Tranel 1996a,b) or with some degree of stipulation (Bonet, Lloret 
& Mascaró 2007; Round 2013; Wolf 2015). On this approach, “deletion” is apparent only, 
due in reality to the selection between two input allomorphs, one of which contains only 
a subset of the segments in the other (for a proposal not unlike this for Yidiny, see Hayes 
1997). An example is shown in (26), where the grammar optimally selects between two 
input allomorphs of the undergoer root bigunu- ‘shield’. 


(26) | (/bigunu/, /bigun/} ‘shield[aBs]’ | Anc | Max-C/RT | Prs | Max 
a. pe /bigun/ :: (bi gu:n) | 
b. /bigun/ :: (bi gu:) nu | xW 
c. /bigunu/ :: (bi gu:n) | *W 
d. /bigunu/ :: (bi gu:) nu | xW 


Two objections can be raised. First, because the approach simply lists alternant pairs, it 
misrepresents their resemblances as accidents, rather than relating them systematically. 
Relatedly, in the context of Richness of the Base, the analysis would allow the apparent 
deletion of morph-initial and -medial segments as well as root consonants, by leaving 
them out of an underlying allomorph, in a pair such as {/bigunu/, /gunu/}. Ranking ANc 
and Max-C/noor highly would not ameliorate the problem, as shown in (27). 
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(27) {/bigunu/, /gunu/} Anc Max-C/roor | Prs | Max 


a. 9"* /gunu/ :: (gu nu) 


b. /bigunu/ :: (bi gu:) nu xW 


Second, it is unclear how the analysis would prevent apparent deletion in word medial 
positions in the event that it is optimsing, as in (28), where the true output bupala-ņa:-lna 
violates Prs while the more optimal false winner *bujal-na-Ina does not. The constraint 
CNTG will not prevent this occurring. 


(28) +{/bugala, buyal/}-rna-Ina/ Es ve 
= n 
‘finely ground-cAUsE-PURP' 59l i 
a. "€  /bujala-ga-Ina/ :: (bu ja) (la ga:l) na *L 
* ` /bugal-na-Ina/ :: (bu gal) (gal na) 


I conclude that neither the co-phonological approach nor the allomorph-selection ap- 
proach offers a viable alternative for Yidiny word-final deletion. 


5 Learning exceptionality 


5.1 Biased Constraint Demotion 


I turn now to consider how exceptionality is, or isn’t, learned. After introducing Prince 
and Tesar’s (2004) Biased Constraint Demotion (BCD) algorithm and adaptations of it 
for the learning of indexed constraints, I show that the learning of Yidiny word-final 
deletion does not proceed as one might expect from the discussion in §4. A solution is 
then offered in §6. 

Prince and Tesar’s BCD is a computationally efficient algorithm for the learning of 
OT grammars. It builds upon Tesar’s earlier Recursive Constraint Demotion (RCD) al- 
gorithm (Tesar 1995, Tesar & Smolensky 2000), deterministically learning a grammar, 
conditional on the data, by ranking constraints in a series of steps, or recursions. At the 
first step, one or more constraints is assigned to the highest-ranked CONSTRAINT STRA- 
TUM in the grammar. A stratum is a set of constraints whose relative ranking against 
one another is indeterminate given the data, but whose ranking relative to constraints 
in other strata is significant. The act of assigning constraints to a stratum is termed IN- 
STALLATION. At each subsequent step, one or more additional constraints are installed in 
the next-highest stratum, and so on, until all constraints are ranked. The determination 
of which constraint(s) are installed next is based on evidence from winner-loser pairs 
(WLPs). For each WLP, any constraint yet to be installed will favor the winner in the 
pair, the loser, or neither. The full table of WLPs and constraints yet to be installed is 
termed the support. A fragment of a support is shown in (29). The relative order of 
constraints and WLPs in a support is inconsequential, though for ease of inspection I set 
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out markedness constraints to the left of a vertical double line, and faithfulness to the 


right. 


29 Z 3 Ee [o 
Ek 82|g 53 EIE 
&l&|l&v"(z|Oo d 
a. /margu-ni/ wlw L 
(mar gu:n) > (mar gu:) ni 
b. /guygal-ni/ 
SE L | L W W 
(guy ga:l) ni > (guy ga:l) 
c. /guygal-ni/ LILI wlw 
(guy ga:l) ni > (guy ga:ln) 
d. -ni 
/guygal-ni/ wWILIL 
(guy ga:l) ni > (guy ga:l ni) 
e. /bulmba/ 
L|LWI|W 
(bulm ba) > (bul ba) 


In the original RCD algorithm, the sole criterion for installing a constraint was that it 
favor no losers. This is true of the constraints FTBIN, CNTG and ANC in (29). When a con- 
straint, C, is installed, all of the WLPs for which C favors the winner are removed from 
the support, since the constraint ranking has now accounted for them. In the RCD, all 
constraints meeting this criterion at any recursion are installed, and the result at the end 
of all recursions is a correct grammar for the data. Nevertheless, the grammars inferred 
by the RCD are not optimal (Prince & Tesar 2004). The suboptimality relates to the subset 
problem (Baker 1979; Angluin 1980), a general problem in algorithmic learning from posi- 
tive evidence, namely that the system which results from learning will correctly assess as 
grammatical all attested items, but will fail to rule out certain systematically unattested 
items. This in turn relates to the notion of restrictiveness: a learning algorithm ought 
ideally to learn the most restrictive grammar consistent with the data. The RCD does not 
do this. In practice, meeting this desideratum is challenging for an efficient algorithm. 
However Prince & Tesar (2004) demonstrate that good headway can be made by enhanc- 
ing the RCD with a small set of biases, hence the name Biased Constraint Demotion, 
or BCD. The BCD differs from the RCD in two main respects. The first is the principle 
of FAITHFULNESS DELAY. According to this, at every recursion faithfulness constraints 
are not installed, even when they favor no losers, unless there are no other installable 
constraints. In (29) for example, the BCD would install the markedness constraint FrBiN 
but not the faithfulness constraints CNTG and Anc. If we do this, and remove from (29) 
all the WLPs for which FrBin favors the winner, namely (29d), and remove FrBiN, we 
have (30), in which only faithfulness constraints, CNTG and Anc favor no losers; under 
these conditions, faithfulness delay would permit the installation of CNTG and ANc. 
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(30) 

3 "S ie 
1 = bd E [9] 
n n Pu a z 
č £92 ó|s 

a. /margu-ni/ wlw L 

(mar gu:n) > (mar gu:) ni 
b. /guygal-ni/ LIL w " 


(guy ga:l) ni > (guy ga:l) 
c. /guygal-ni/ 

(guy ga:l) ni > (guy ga:ln) 
e. /bulmba/ 

(bulm ba) > (bul ba) 


L|LW|W 


However, there is a second principle to consider also. A principle of "freeing up 
markedness" states that when there is a choice between installing several faithfulness 
constraints, the algorithm should install the smallest subset possible, whose installment 
would cause a markedness constraint to become installable in the next recursion. For ex- 
ample, in (30), installing CNTG would remove WLP (30e), thereby freeing up the marked- 
ness constraint "Cprx at the next recursion; no comparable gain would flow from in- 
stalling Anc. On those grounds, from (30) the BCD would install CNTG. 


5.2 A support for learning Yidiny exceptionality 


Inow consider several learning scenarios for Yidiny exceptionality. Each begins directly 
after the installation of undominated constraints. Table 6 contains a set of WLPs that is 
representive of all combinations of roots and suffixes which are relevant to the grammar 
of word-final deletion: it is not the complete support, but it represents the complete 
support well. Segments which can delete are underlined. To economize on space below, 
WLPs will be referred to by the letters in the first column of Table 6. 


5.3 Learning the phonological account (preliminary version) 


We begin with the learning of the phonological account of Yidiny exceptionality de- 
scribed previously in 84.2. For the moment, I assume that input segments are already 
lexically ®-indexed as u or n. We begin after undominated constraints have been in- 
stalled, with a support as in (31). 


76 


4 Phonological exceptionality is localized to phonological elements 


Table 6: Support for learning Yidiny exceptionality. 


a. /margu-ni/ (mar gu:n) >(mar gu:) ni 
b. ` /guygal-ni/ (guy ga:l) ni >(guy ga:l) 
c. Jguygalni/ (guy ga:l) ni >(guy ga:ln) 
d.  /margu-ggu/ (mar gu:n) >(mar gu:9) gu 
e. /margu-ngu/ (mar gun) >(mar gu:gg) 
f  /bigunu-yi-ngu/ (bi gu) (nu yix) >(bi gun) (yin gu) 
g.  /wawa-lnu/ (wa wail) >(wa wall nu 
h. /gali-ga/ (ga li:j) >(ga li:) ga 
i. /gagara/ (ga ja:r) >(ga ja:) ra 
k.  /margu-nda/ (mar gu:n) da >(mar gu:n) 
L /wawa-lna/ (wa wa:l) na >(wa wa:l) 
m. /gali-na/ (ga li:) na >(ga li:n) 
n. /guyara/ (gu ja:) ra (gu ja:r) 
o.  /majinda-ga-lna/ (ma jin) (da ya:l) na 7 (ma in) (da ga:l) 
p. /bulmba/ (bulm ba) 7 (bul ba) 
(31) Prs-u | Prs | *Crrx || Max | Max-C | Cntc | ANC 
a, h, i. W W L 
b. L L W W W 
c. L L W W 
d, g. W L L 
e. W L L 
f. L L W 
j, k, 1, o. L W W 
m, n. L W 
p. L W W 


Support (31) does not contain any markedness constraints that favor no losers. Two 
faithfulness constraints favor no losers: CNTG, which would free up *Crrx if installed, 
and Anc, which would not free up any markedness constraints. Consequently, CNTG 
is installed next, removing WLPs (f) and (p) from the support. After that, the newly 
freed-up “CPLx is installed, removing WLPs (c) and (e), and leaving (32). 
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(32) Prs-u | Prs | Max | Max-C | Anc 
a, h, i. W W L 
b. L L W W 
d, g. W W L L 
j, k, l, o. L W W 
m, n. L W 


In 32 only Anc favors no losers, and so is installed. This removes (b), freeing up 
Pns-u, which is installed next, removing (a,h,i) and (d.,g), leaving (33). From (33), Max 
will be installed since it frees up Prs. This leaves Prs and Max-C, which according to 
faithfulness delay, will be ranked last as Prs > Max-C, as in (34). 


(33) Prs | Max | Max-C 
j, k, 1, o. L W W 
m, n. L W 


(34) CNTG 'CPLx > ANC  Pns-u > Max > Prs > Max-C 


Some comments are in order. First, the BCD algorithm has learned the key constraint 
ranking Prs-u >> Max > Prs responsible for the core of Yidiny exceptionality. Secondly 
however, it has also ranked ANc > Prs-u, in which case the learned grammar expressly 
prohibits morph-initial deletion. Indeed, had MAx-C/nr been included in (31), it would 
also have been ranked highly since it only ever favors winners, meaning the grammar 
would also expressly prohibit CV deletion in roots (the reasons for my excluding Max- 
C/RT are clarified in §6). This means that the algorithm is learning precisely the rankings 
required to prevent the phonological solution from overgenerating, thereby voiding the 
major criticism of the phonological approach which was introduced in $4.2. This is per- 
haps surprising, so why is the ranking learned? It is learned because the BCD algorithm 
attempts to construct a restrictive grammar. The typical assumption, that grammars 
implementing a phonological approach would not assign redundant, high rankings to 
constraints like ANc, is predicated on an implicit assumption that the learner would be 
seeking a PERMISSIVE grammar; doing so leads to overgeneration. However no success- 
fullearner would adopt that assumption, because successful learning in general requires 
a restrictive approach. For the theory of exceptionality, this is significant. It means 
the result obtained here, in which a phonological approach to exceptionality has been 
learned without overgeneration, is not dependent on some minor detail of the BCD, or 
the constraints used, or even OT. Rather, it follows from a general principle of learning. 
Consequently, the adoption of realistic assumptions about learning narrows the perfor- 
mance gap between the phonological and morphological approaches. I will examine the 
phonological approach further in $7.3. 
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5.4 Learning indexed constraints and the morphological analysis 


We consider next the learning of the morphological approach. The support begins, after 
installation of undominated constraints, as (35). These are the same constraints and 
WLPs as in the previous section, but without Prs-u. The support begins with no lexically 
indexed constraints; how they are learned is considered shortly. I also do not include 
Max-C/nr in the support. MAx-C/rT is essentially a variant of Max-C, indexed to all root 
morphs. This is the kind of constraint we might reasonably expect the morphological 
approach to learn. 


(35) Prs | *CPLX 


= 
e 
ei 


Max-C | CNTG | ANC 


sjsj Edad EHE m 


W W 


p. L 


Turning now to the BCD algorithm, neither of the markedness constraints in support 
(35) favors no losers. CNTG does, and would free up *CPrx. Anc also does, but would 
not free up any markedness constraints. Accordingly, CNTG is installed next, removing 
WLPSs (f) and (p) are from the support, and *Crrx after that, removing (c) and (e), leaving 
(36). 


(36) Prs | Max | Max-C | Ans 
a, h, i. W L 
b. L W W W 
d, g. W L L 
jklo. L W W 
m, n. L W 


ANC is installed next, removing WLP (b), which leaves (37), a support in which there 
is no constraint which favors no losers. 
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(37) Prs || Max | Max-C 
a, h, i. W L 
d, g. wi L L 
jk lo | L W W 
m, n. L W 


Supports in this state are said to have reached INCONSISTENCY. An inconsistency, how- 
ever, is not a failure. 

Inconsistencies indicate that the combination of data and assumptions currently under 
consideration have not led to a working grammar. Accordingly (assuming the data is 
correct), a revision of the assumptions is warranted. Suppose, in this case, that a revision 
could be made which leaves intact all previously installed constraints and their rankings, 
and the validity of all previously accounted-for WLPs, that is, a revision that would 
change only what is in the support. Suppose also that as a result of this revision the 
support came to contain a constraint that favors no losers. Such a revision would resolve 
the inconsistency. The BCD could restart and, one hopes, lead to a working grammar. 
Revisions that meet these criteria can be considered a type of learning. One such revision 
is to add a new, lexically M-indexed constraint to Con. 

Pater (2009) describes a method for learning M-indexed constraints and assigning co- 
indices to morphs, which takes a BCD inconsistency as its starting point. Coetzee (2009) 
extends this to Output-Output constraints, which I will not consider here. Becker’s mod- 
ifications (Becker 2009; Becker, Ketrez & Nevins 2011) are addressed in §8. 

Central to Pater’s method is the operation of CONSTRAINT CLONING, a process I de- 
scribe informally here and return to in detail in §8. Within the stalled support, a con- 
straint C is sought which, if it were indexed to some set M of morphs, would (i) favor at 
least one winner? and (ii) favor no losers. Assuming such a constraint C can be identi- 
fied, it is then cloned, which is to say, a lexically M-indexed version of it, Cy, is added 
to the support. Because Cu favors no losers, it is installed next. For example, support 
(38) is the same as (37) but now displays information about which morphs are involved. 
I have annotated relevant undergoers as U and non-undergoers as N. 

According to the criteria for cloning, all three of PRs, Max and MaxC are candidates 
for cloning (indexed to sets U, N and N respectively). I assume that owing to faithful- 
ness delay, markedness constraints are cloned in preference to faithfulness when both 
are available, in which case Pns will be cloned. In (39) the cloned, lexically M-indexed 
constraint PRsy is added to the support. Installing it removes WLPs (a,d,g,h,i) which 
frees up Max, whose installation is followed by Prs and Max-C. The resulting ranking 
is (40), which requires comment. 


15 The new constraint needs to favor at least one winner to have any chance of freeing up another constraint 
once it is installed. 
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(38) Prs | Max | Max-C 
a. /margu-niy/ W L 
d.  /margu-niu/ Ww L L 
g. /margu-nguy/ W L L 
h. = /gali-nay/ Ww L 
i /gajaray/ W L 
j  /binarya/ L W W 
k. /margu-nday/ L W W 
L. /wawa-lnay/ L W W 
m. /gali-na/ L W 
n. /gujgaraw/ L W 
o.  /majinda-ga-lnaw/ L W W 
(39) Prs-y | Prs || Max | Max-C 
a, h, i. W W L 
d, g. W W L L 
j, k, l, o. L W W 
m, n. L W 


(40) CNTG > *CPLx > ANC > Prsy > Max > Prs > Max-C 


The algorithm has successfully learned the key constraint ranking PRsy > Max > 
Prs. However, it did not create an indexed version of Max-C for roots, and thus has 
not learned to expressly prohibit CV deletion in roots. To be sure, no individual roots 
ending in CCV or VC’V (where C would be an illicit coda) will have been co-indexed to 
Prsy during the cloning operation (see §8 for details) and so none of those roots will be 
subject to CV deletion, however the ranking in (40) predicts that if the lexicon did contain 
a root such as “binarnay, then that root and any like it would undergo CV deletion. This 
is overgeneration of the same kind which was believed to beset phonological accounts. 
Thus, while §5.3 showed that grammars learned for the phonological account may suffer 
less than expected from overgeneration once learning is taken into consideration, §5.4 
shows that grammars for the morphological account may suffer from overgeneration 
more than expected. In the next section, I propose a solution. 


6 Morphological analytic bias: the Morphological 
Coherence Principle 


In §5.4 the grammar which was learned for a morphological analysis of Yidiny excep- 
tionality suffers from a manifestation of the subset problem. Although the algorithm 
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correctly handled all attested data, it did not learn the more restrictive generalization 
which applies also to unattested data, that roots in Yidiny do not undergo consonant 
deletion. The problem arises because the cloning procedure assesses morphology on a 
morph-by-morph basis only, whereas the true generalization in Yidiny applies to a class 
of morphs, in this instance, to roots. The remedy to be pursued here has two parts. It 
adds a new kind of constraint cloning, which indexes a constraint not to an idiosyncratic 
lexical list of morphs, but to a general class. It then biases constraint cloning so that class- 
indexed (or K-INDEXED) cloning is preferred over lexically indexed cloning. Effectively, 
this introduces an analytic bias (Moreton 2008) from morphology to phonological learn- 
ing at BCD inconsistencies. 

Now, supposing that the algorithm is seeking a constraint that it will clone and K- 
index to some non-idiosyncratic class of morphs, which classes should be available 
for the learner to consider? Important here is the fact that human phonological learn- 
ing will need to proceed in parallel with, and interleaved with, morphological learning 
(Tesar 2007, Merchant 2008: 6). Accordingly, I assume the learner has access both to 
universally-defined classes such as “root”, and those classes which have been morpho- 
logically learned, such as ERGATIVE CASE. The biasing principle, which I term the Mor- 
phological Coherence Principle is stated in (41), where criterion 2 provides an additional 
bias towards maximal restrictiveness. 


(41) The Morphological Coherence Principle: 

1 Ata BCD inconsistency, attempt to create a K-indexed constraint, 
co-indexed to some universal or learned morphological class K, before 
attempting to create a lexically-indexed constraint. 

2. If multiple constraints are eligible for K-indexation, select the one whose 
co-indexed class is most general. 


The MCP has some desirable theoretical properties. If the universal state of Cow at 
the commencement of learning is CON;nit, then the MCP obviates the need for CON ini to 
contain any constraints that are relativized to universal or learned morphological classes, 
since such constraints will be learned on demand, if and only if needed. In effect, this 
reduces the size of CON;nit without any change in the explanatory capacity of the theory. 
And, since it allows the grammar to BUILD constraints for language-specific morphologi- 
cal classes it makes those constraints available to the learner without problematically as- 
suming them universal (Russell 1995, Hammond 2000, see also Smith 2004, Flack 2007b). 
The MCP operationalizes, in a specific manner, the kind of insight into linguistic theory 
that Anderson (2008) argues ought to follow from an improved understanding of the 
learning device. 

Let us now return to Yidiny exceptionality, equipped with the MCP. Learning begins 
and proceeds as in §5.4 until the inconsistency in (38), at which point a constraint is 
sought for cloning. The MCP states that if possible, a constraint should be cloned and 
K-indexed. In (38) MAx-C would favor no losers if it were K-indexed to the entire class 
of roots, so it is cloned and accordingly K-indexed. This is the functional equivalent of 
adding Max-C/rT to Con, and the reason why in §5 I did not include Max-C/nr in the 
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support at the outset. Adding MAx-C/nr to the support results in (42). From (42), MAx- 
C/rT is installed and WLP (j) is removed, whereupon we return to inconsistency, in (51). 
As in 85.4, the process from that point results in the cloning of Pns and the installation 
of Prsy, then Max, Prs and Max-C, yielding the desired constraint ranking (43). 


(42) Prs | Max | Max-C/Rr | Max-C 
ahi. | W L 
d, g W L L 
j: L W W W 
k, Lo L W W 
mn L W 

(43) Prs | Max | Max-C 
a,h,i. | W L 
d, g. WwW L L 
k, l, o L W W 
m,n L W 


(44) CNTG > *CPLX >> ANC > Max-C/RT > Prsy > Max > Prs > Max-C 


To summarize, results from §5.3 suggested that, provided a learner is seeking a restric- 
tive grammar, the phonological approach to exceptionality may not suffer from overgen- 
eration. This contradicts recent arguments, which on examination appear to adopt the 
implausible assumption that a learner would be seeking a permissive grammar. That be- 
ing said, I have not yet clarified how the learner would arrive at the requisite ®-indices 
required by the phonological approach. That will be discussed in §7.3. Meanwhile, §5.4 
revealed that without further refinement, the BCD is prone to learning grammars that 
overgenerate even in a morphological approach to exceptionality, due to an overly atom- 
istic method of morphological generalization. This was remedied in §6 by the Morpholog- 
ical Coherence Principle (41), which solves the learning problem and simplifies CON jnit. 


7 The theoretical status of lexical indices 


In §7 I set Yidiny to one side and consider some matters of theory. 


7.1 Lexical M-indices 


Lexical M-indices are representations which are visible to the phonology, but they are 
not phonological elements per se. In OT, GEN cannot alter M-indices. It cannot add 
or remove them, or displace them from one morph to another. There is therefore no 
need for mechanisms such as M-index “faithfulness”, rather it is simply assumed that 
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the lexical affiliation of a morph m with an M-index M is identical in the input and 
output. This set of properties is shared with other kinds of lexical affiliation, such as 
the affiliation of a phonological element with its morph, and is termed Consistency of 
Exponence (McCarthy & Prince 1993b, Van Oostendorp 2007). 

Taking a historical view, M-indices closely resemble the RULE FEATURES and ALPHABET 
FEATURES of early generative phonology (GP) (Chomsky & Halle 1968, Lakoff 1970, Coats 
1970, Zonneveld 1978, inter alia). Both sets of formalisms fulfill the function of determin- 
ing for cases of exceptionality whether a morph m participates in certain phonological 
patterns or not, by ensuring that m is visible or not visible as required, to OT’s con- 
straints or GP’s phonological rules. Diacritic features were investigated extensively in 
GP. It was argued that the theory should not allow the phonology to manipulate diacritic 
features (Kiparsky 1973, Zonneveld 1978). The same applies to M-indices in OT. It was ar- 
gued that not all idiosyncrasies in the phonology can be analysed satisfactorily in terms 
of rule exception features, and that there is an additional role for cyclicity (Chomsky 
& Halle 1968, Kiparsky 1982b) and the same has been recognized for M-indices (Pater 
2009). In GP, it was also assumed that the diacritic features of morph m were distributed 
across, and directly characterized, each of the phonological elements (namely, segments) 
in m. We might ask whether this is also true of M-indices in OT. Suppose that it is, so 
that the M-indices of a morph m directly characterize each phonological element q that 
is lexically affiliated with m (that is all o which are EXPONENTS of m). In that case, the 
relative definition of an M-indexed constraint (25), repeated here as (45), can be revised 
and simplified as (46). 


(45) V(Cu) =aer V(C) N Lo(meM & Expo, m)), where: 
M is the set of morphs co-indexed to Cu 
Ex»(q, m) states that element q is an exponent of morph m 


(46) V(Cm) =aef V(C) N Lo(pedy), where: 
®y is the set of phonological elements co-indexed to Cy. 


It will be recalled that the relative definition of a constraint Cy is expressed as the set 
intersection between the loci of variation of the unindexed constraint C, written V(C), 
and the set of loci, Ly(D(@)) which contain some criterial type of phonological element 
9, described by predicate Dia), Importantly, this means that M-indexed constraints are 
defined DIRECTLY in terms of phonological elements, g, and only indirectly in terms of 
morphs m. The indirectness shows up in the complexity of D(q) in (45), which links 
morphs to their exponent o elements via the function Exp(g, m). This is in contrast with 
(46), where the assumption is that all o elements are directly characterized by the M- 
index borne by their affiliated morph. The constraint definition no longer refers to the 
morph itself, and so the predicate Dia) is simpler. 

At risk of laboring the point, the phonology itself assesses violations of M-indexed 
constraints directly in terms of 9 elements, not morphs. While it is possible to refer to the 
morphs in the definitions of M-indexed constraints as in (17/45), it is not necessary. Nor is 
it possible to refer only to the morphs and not to the g elements, since the loci of violation 
of these constraints are defined inherently at a sub-morphological, phonological level. 
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7.2 Lexical ®-indices 


Let us now consider the nature of lexical ®-indices of the type I invoked in 84.2 and §5.3. 
My proposal is that these are exactly like M-indices: they are non-phonological indices of 
lexical affiliation, visible to, but not manipulable by, the phonology and used for making 
particular phonological elements visible or not, as required, to OT’s constraints in order 
to provide a coherent account of exceptionality. The only distinction between ®-indices 
and M-indices lies in the supplementary assumption attached to M-indices, in (47). 


(47) The M-index assumption: 
A lexical index which characterizes phonological element o: will also 
characterize all other phonological elements o: affiliated with the same morph m. 


®-indices are not subject to this redundancy; they are affiliated with only those o ele- 
ments for which the affiliation makes any difference to the analysis of language. As I 
will show in 88, that makes ®-indices somewhat simpler to learn, since they correspond 
more directly to the evidence in the data. 

The reader may also have noticed that the definition of a ®-indexed constraint in 
(21) is almost exactly like the simplified definition of an M-indexed constraint in (46). 
This reflects the fact that for the operation of the phonology, it is o elements, and the 
indexation of specific o elements, that matter. Whether or not one chooses to adopt 
supplementary assumption (47) in fact has no material consequence for the evaluation of 
an individual indexed constraint. The question of whether there are other consequences, 
and whether they are desirable, is taken up in 89. 


7.3 Learning lexical ®-indices 


Given the proposal above, the learning of d-indices is quite parallel to the learning of 
M-indices. I assume that the MCP still applies, so that class-based exceptionality and 
K-indexed constraints continue to be learned with priority over idiosyncratic exception- 
ality, even though the latter will now be accounted for by ®-indexed constraints, not 
M-indexed. This is a coherent assumption to make. The MCP is concerned with the 
learning of class-based generalizations, whereas ®- and M-indexed constraints are alter- 
native devices for learning idiosyncrasies. Accordingly, in a stalled support once there 
are no K-indexed constraints available for cloning, the algorithm seeks a constraint C 
which, were it indexed to some set ® of phonological elements, would (i) favor at least 
one winner and (ii) favor no losers. All else proceeds as for M-indexed constraints. In 
the learning of Yidiny word-final deletion, the process begins as in $6, leading to a first 
inconsistency resolved by the addition of MAx-C/nT to Con, and proceeding from there 
to the second inconsistency (43), repeated here in part and in more detail as (48). 
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(48) Prs || Max | Max-C 

a. /margu-ni/ " L 
(mar gu:n) > (mar gu:) ni 

d.  /margu-ggu/ w L L 
(mar gun) > (mar gu:n) gu 

h. /gali-ga/ w L 
(ga li:ņ) > (ga li:) ga 

i  /gajara/ w L 
(ga ja:r) > (ga ja:) ra 

k. - 
/margu-nda/ L W w 
(mar gu:n) da > (mar gu:n) 

m. / palina L w 
(ga li:) na > (ga li:n) 

n. /gugara/ L w 
(gu ja:) ra > (gu ya:r) 

o. /maşinda-ņa-lna/ L w W 
(ma jin) (da ya:l) na > (ma jin) (da nal 


In (48), no K-indexed constraint is available for cloning. Turning to potential ®- 
indexed constraints, we see that the constraint PRs would, if it were co-indexed to all 
underlined phonological elements, favor at least one winner and favor no losers, and so 
it is cloned and co-indexed resulting in (49). 


(49) Prs | Prs-u | Max | Max-C 

a.  /margu-ni/ W W L 

d.  /margu-ggu/ W W L L 

h. /gali-ņa/ - w w L 

i /gagara/ W W L 

k.  /margu-nda/ L W W 
m. /gali-na/ L W 

n. /gujgara/ L W 

o. /majinda-ga-Ina/ L W W 


(50) CNTG > *CpLx >> ANC» Max-C/RT > Pns-u > Max > PRS 


16 Actually this is not strictly true. All past suffixes for example are undergoers, in which case the MCP would 
generate and rank Pnspsr. Notwithstanding this, the essential argument remains, since other morpholog- 
ical classes exist, such as ERGATIVE and “root”, that are not uniformly (non)undergoers, and still need to 
be handled by lexically-indexed, not K-indexed, constraints. This minor correction applies equally to the 
learning process in $6. 
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From there the algorithm proceeds in the now-familiar fashion, resulting in grammar 
(50). With its high-ranking MAx-C/Rr and Anc, (50) does not overgenerate. Moreover, 
given the argument in §7.2, that for Evar there is no detectable difference between M- 
indexed and ®-indexed constraints, we can see that grammar (50) is in all material as- 
pects identical to grammar (44) learned in §6. 


8 Constraint cloning 


8.1 Assessing eligibility for cloning 


It is necessary now to examine more precisely the processes by which constraints are 
deemed eligible for cloning (§8.1), by which a viable set of co-indexed elements is identi- 
fied (§8.2), and by which a selection is made between multiple eligible constraints (§8.3). 

Earlier, I introduced criteria by virtue of which a constraint becomes eligible for clon- 
ing. These are restated in (51) in a generalized from, so that the set S is: a coherent 
class of morphs for K-indexing; an idiosyncratic set of morphs for M-indexing; or an 
idiosyncratic set of lexical phonological elements for ®-indexing. 


(51) A constraint should be sought for cloning which, if it were indexed to set S, 
would (i) favor at least one winner, and (ii) favor no losers. 


Criterion (51ii) ensures that once the cloned constraint is added to the support, it can 
be installed; (51i) ensures that its installation will remove at least one WLP from the 
support, and thereby have some hope of freeing up other constraints. The formulation 
in (51) improves upon Pater's (2009: 144) criterion, which is to seek a constraint that 
favors no losers “for all instances" of some morph.” To see why Pater’s criterion fails, 
consider WLPs (h,l,o) from the stalled support (38), reproduced in part and in detail in 
(52). For the purposes of discussion, I assume we are attempting to learn an M-indexed 
constraint, though the argument generalizes to other kinds. 


(52) Prs | Max | Max-C 

h. /gali-ga/ ‘go-com[ImP]’ w 
(ga li:ņ) > (ga li:) na 

l  /wawa-lna/ ‘see-PURP’ 

L W W 

(wa wa:l) na > (wa wal) 

o. /majinda-ga-Ina/ ‘walk up-coM-PURP L Ww W 
(ma žin) (da na:l) na > (ma jin) (da ya:l) 


In (52), WLPs (h) and (o) both contain the suffix -na, a regular undergoer which our 
procedure ought to co-index to the M-indexed constraint PRsu. In WLP (h) word-final 
na is subject to deletion, and Pns favors the winner. In WLP (o) non-final na is parsed 


17 Pater's phrase “favors only winners" is equivalent to my “favors no losers". 
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into a foot and escapes deletion. Nevertheless, for WLP (o) Prs favors the loser. This 
has nothing to do with na, but is due to the non-deletion of the unparsed, word-final 
non-undergoer -/na. Pater’s co-indexing criterion asks whether Prs favors no losers “for 
all instances" of -ņa in the support. The answer is “no”, because (o) contains an instance 
of -ņa and Prs favors the loser for (0). This is the wrong result; the suffix -ņa ought to 
get co-indexed to Prsy. It comes about because Pater's criterion does not discriminate 
between morphs that contribute to violations and those which are present in the word, 
but do not contribute. The criteria in (51) avoid this problem because they refer directly 
to how the co-indexed constraint would perform, were it created. The next two sections 
detail how to operationalize them. 


8.2 Specifying co-indexed sets 


The question considered here is which set S ought to be co-indexed to a given constraint 
C if we wish to clone C? The answer varies depending on which kind of indexed con- 
straint we are constructing. One possible answer is that no such set exists, and C cannot 
be cloned. Seen from that angle, the question here is also: is C eligible for cloning? 

K-indexed constraints can be co-indexed only to the morphological classes K ;, K? ...K 
in the language ($6). In (41) I suggested that the preferred class for co-indexation is the 
MOST GENERAL one. Thus, to efficiently assess if constraint C is eligible for cloning and K- 
indexing, the learner should proceed stepwise through the available classes, ordered by 
decreasing generality. The process is one of trial and error. At each step, the constraint 
Cx is built and applied to all WLPs in the support. If Cx meets criteria (51) then it is 
successful; the process halts and Cx is used, otherwise the trial and error continues. If 
by the end, no successful constraint Cx; ...Cg, is found, then C is ineligible for cloning. 

For M-indexed and ®-indexed constraints, the desired set S can be identified by focus- 
ing attention on loci of violation. Suppose we are considering constraint C for cloning. 
For any WLP, p, its loci of violation of constraint C fall into three classes: the class w(p), 
responsible for violations of C that favor the winner (i.e., the locus occurs in the loser 
only), class L(p) which favor the loser (locus occurs in the winner only) and class N(p) 
which favor neither (occurs in both). Next define $45) as the set of phonological ele- 
ments ¢ contained in any of the loci in w(p), and 4; as the set of o elements contained 
in any of the loci in 1(p). Finally, define ®w as the union of Pw(p) for all WLPs, pi, p» ... 
Pn, in the support, and ®, as the union of all ®,(,) in the support. Now, consider the set 
(Oy — 1), the set difference between ®w and d. This is the set of all o elements which 
both (i) appear in at least one locus that in at least one WLP causes C to favor a winner, 
and (ii) never appear in a locus that causes C to favor a loser. For a ®-indexed constraint 
this is an optimal set S. If for a given constraint C, (Pw — c) is the null set, then we may 
conclude that C is ineligible for cloning." 


18 To be precise, if (w — d) is the null set then it is possible that there still exists some additional, viable 
set S which contains fortuitous elements o: which are elements of both dy and dn. such that in EVERY 
WLP p in which qj is contained in some number n of the loci w(p) there are at least n offsetting loci in 
L(p) which contain other elements qj which are also in S. Identifying these fortuitous elements qi, or even 
determining if any exist, would very likely be prohibitively expensive computationally. 
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To find the equivalent for an M-indexed constraint, it is necessary to extrapolate from 
®y and d; to morphs: set S will be the set (Mw — M) where Mw is the set of all morphs 
Mw, such that any of m,,’s phonological exponents is an element of Pw; and M; is the set 
of all morphs mij, such that any of mue phonological exponents is an element of dn. Note 
that Mw and Mr, can be calculated only after the calculation of Py and dy, is performed. 

In 87.1 I considered what is involved computationally in assessing violations of ®- 
and M-indexed constraints, and argued that the calculations for both are essentially con- 
cerned with o elements, not morphs. Here we see that the same is true when learning 
the co-indexed set. As in 87.1, one can bring morphs into the picture, to be sure, but in 
both cases doing so requires additional computational effort, for no effective difference 
in how the grammar will work. In 89 I will argue the theory to be preferred is one which 
admits lexically ®-indexed constraints, but not M-indexed. 


8.3 Selecting among eligible constraints 


Suppose there are multiple lexically-indexed constraints which are eligible for cloning; 
which do we choose? The principles of faithfulness delay and freeing-up of markedness 
constraints will eliminate some options ($5.1). Beyond that, I suggest the learner chooses 
the constraint which favors the most winners, and whose installment would therefore 
remove the greatest number of WLPs. A desirable consequence will be a bias toward 
restrictiveness. For example, suppose Max is eligible. If so, then so too is MAx-C, MAx- 
V, Max-p, etc. This *maximize-winners" criterion would select MAx, and increase the 
restrictiveness of the grammar, relative to the other options. 

Interestingly, Becker (2009) proposes a MINIMIZE-winners criterion, whose effect is 
to generate many, very specific cloned constraints, each indexed to highly specific sub- 
classes in the lexicon. The aim is to account for a particular phenomenon, which I de- 
scribe here. I argue that other accounts are possible, and that Becker's solution has 
undesirable consequences. 

When language learners assign novel words to existing grammatical categories, they 
do so on the basis of statistical correlations that exist in the lexicon, for example be- 
tween category membership and aspects of the members' phonological forms (Poplack, 
Pousada & Sankoff 1982; Albright 2002). One such task is to assign a word as excep- 
tional or non-exceptional, given evidence which underdetermines that choice. The key 
question here is, what existing statistical knowledge do speakers use, and what do they 
ignore? In Turkish, speakers appear to ignore correlations between the (non)alternation 
of a stop's laryngeal features and the quality of its neighboring vowel. It is proposed 
(Becker 2009; Becker, Ketrez & Nevins 2011) that this is because speakers do not access 
lexical statistics per se, rather they attend to the statistics of constraint indexation. Im- 
portantly, Con lacks constraints such as *[+HIGH]tV which refer to a stop and the qual- 
ity of its vocalic neighbor. Consequently, no such constraint can be indexed, making 
such correlations invisible and hence irrelevant to a speaker when she assigns a novel 
word to a (non)exceptional lexical category. Assuming this is the case, then in order for 
fine-grained knowledge to be available to speakers, an atomizing, “minimize-winners” 
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criterion for cloning is needed. However, this solution would seem neither necessary 
nor warranted. 

Notwithstanding the facts of Turkish, speakers in other languages and performing 
other novel-word tasks do use lexical correlations which lack a corresponding constraint 
in Con (Moreton & Amano 1999, Albright 2002, Albright & Hayes 2002, Ernestus & 
Baayen 2003), indicating that speakers are capable of such computation. In that case, 
atomized indexed constraints alone are not enough to produce the Turkish results. An 
additional stipulation is required, that this ability is suppressed when assigning novel 
words to exceptionality classes; yet this leads to a curious view of phonology. Whereas 
the grammar is usually the store of generalizations, just in the case of exceptionality, it is 
a store of highly detailed idiosyncrasy, and just in that case speakers ignore their usual, 
lexical store of idiosyncrasy and turn to the grammar. More satisfying would be to find 
some other explanation of the Turkish data. While that would take us well beyond this 
scope of this paper, it can be noted that what is required is a mechanism that can filter 
the lexical information in some way. That mechanism needn’t be part of the OT gram- 
mar. Indeed, if it is true that learners build certain constraints during learning (Flack 
2007b, Hayes & Wilson 2008, Hayes 2014), then there must exist EXTRA-grammatical 
generalization devices, which may provide the lexicon-filtering power needed. For now 
I conclude that that Becker's proposal follows from just one possible solution to an in- 
teresting puzzle, however both the puzzle and solution are outliers relative to what else 
we know. In contrast, a ^maximize-winners" criterion leads to the learning of restrictive 
grammars, and on those general grounds would appear correct. 


9 Discussion 


9.1 The case against concrete accounts 


Throughout this paper, I have considered only the ABSTRACT phonological approach to 
analyzing exceptionality, gradually building the argument that its superiority to the mor- 
phological approach lies in the fact that it localizes exceptionality to specific o elements, 
which are the elements in terms of which the relevant computation must be carried out. 
CONCRETE phonological approaches also localize exceptionality at a sub-morphological 
level, but compared to the abstract approach they are ill-suited to learning, and to seri- 
ality, as follows. 

Lexical indexation is an ideal response to BCD inconsistency, because it annotates 
the lexicon with indices which are invisible to all previously installed constraints. This 
guarantees, without needing to check, that all previously accounted-for WLPs remain 
accounted for. Even if some of them contain lexical 9 elements which acquire a new 
index, their violations of all previously ranked constraints remain unchanged, since no 
previously-ranked constraint is sensitive to the new index. In contrast, the alteration of 
phonological form — for example, removing a root node from certain segments — may 
very well alter the evaluation of WLPs by already-ranked constraints, thus it requires a 
re-evaluation of the entire ranking. It is not possible to simply repair an inconsistency 


90 


4 Phonological exceptionality is localized to phonological elements 


and resume the BCD process. An ABSTRACT phonological account is therefore easier to 
learn. 

In serial theories, concrete phonological approaches face the problem that in non- 
initial strata, it is possible that a preceding stratum will have removed, altered, moved or 
introduced, those aspects of phonological form which should function as pseudo-indices, 
which lack Consistency of Exponence. This opens up the possibility of all manner of 
phonological manipulations of exceptionality, for which I am unaware of any evidence. 

Taking a more historical view, Chomsky (1964) criticized concrete phonological ac- 
counts espoused by structuralists (e.g. Bloomfield 1939) for the proliferation of under- 
lying segments that they entailed. To the extent that such concerns matter to modern 
phonological theories, -indexation avoids such proliferation by augmenting represen- 
tations with non-phonological indices (cf §7), rather than additional underlying phono- 
logical distinctions. 


9.2 The case against M-indexing 


In §7 and §8 I showed that for both constraint evaluation and constraint learning, ex- 
ceptionality is calculated in terms of phonological elements, not morphs. Morphs can 
be brought into the picture, but at additional computational cost and to no effect. Per- 
haps, however, it is nevertheless empirically true that exceptionality is inherently morph- 
bound. If that were so, then phonological exceptionality in any morph m would always 
be either (i) uniform throughout all phonological exponents of m or (ii) entirely pre- 
dictably located within m. Yet this is not the case. If we accept something along the lines 
of Anderson's (1982) analysis of French schwa as an exceptionally-deleting /ø/ vowel, 
then that exceptional property is neither uniform throughout morphs nor does it have 
a predictable location. Similarly, in Turkish, non-high round vowels are phonotactically 
exceptional outside the first syllable (Clements & Sezer 1982; Hulst & Weyer 1991), yet the 
location of the exception is not predictable, as seen in a comparison of otoban ‘highway’, 
monoton ‘monotone’, fenomen ‘phenomenon’ and paradoks ‘paradox’. There is no doubt 
that in most known cases, exceptionality does happen to be either uniform or predictable 
within a morph, but this follows uninterestingly from the fact that most exceptional 
morphs are short, or that most phonological alternations are either local, in which case 
their location inside a morph is predictably restricted to an edge, or domain-spanning, 
in which case the morph acts uniformly. However, when such uninformative cases are 
set aside, the small, informative residue of evidence does not support the morph-based 
view. 

A second argument in defense of M-indices might be that morphs, and not ¢ elements, 
belong to lexical strata, and that a single morphological diacritic can therefore coherently 
index a whole set of phonological exceptionality patterns, patterns which impact differ- 
ent parts of the morph and which therefore would be only incoherently represented by 
individual diacritics on g elements. Yet the empirical falsity of this claim has long been 
recognized. SPE (Chomsky & Halle 1968) permitted both stratal diacritics, later labeled 
MORPHOLOGICAL FEATURES (Postal 1968) and more specific RULE FEATURES (Lakoff 1970), 
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in view of the fact that distinct phonological patterns associated with strata are not uni- 
formly attested in all morphs. For more recent work, see for example Labrune (2012: 
71,72,85ff) on Japanese. 

A third argument in defense of M-indices might be that since some kinds of phonologi- 
cal exceptionality are cyclic (§7.1), and since cycles are inherently tied to morphology, not 
9 elements, then something like M-indices are required anyhow, in which case ®-indices 
are redundant. I would suggest that this is a category mistake. While it is true that cycles 
are inherently tied to morphology, they are tied not to morphs, but to morphological 
operations. Some operations are non-concatenative and hence morph-free (Anderson 
1992). Cyclicity effects, therefore, are about how phonological subgrammars correlate 
with OPERATIONS; in contrast, ®-indices are about correlation with Forms. M-indices fall 
uncomfortably in between. Since they are inherently attached to morphs, they will be 
unavailable for the triggering of cyclicity effects associated with non-concatenative oper- 
ations. And, as we have seen above, they are inefficient, and in all likelihood insufficient, 
devices for exceptionality of forms. 


10 Conclusion 


For most of the generative period, an implicit assumption has been that we must choose 
between a concrete phonological and a diacritic morphological approach to phonological 
exceptionality.? But the argument from learning is that the correct theory is phonologi- 
cal and diacritic, based on lexical phonological indices which are visible to the phonology 
but not manipulable by it. The concrete phonological approach, whose pseudo-indices 
are manipulable by the phonology, is ill-suited to efficient learning (89.1). Diacritic ap- 
proaches are well suited to learning; however the computation of exceptionality is sim- 
ply not carried out in terms of morphs, rather its currency is lexical phonological ele- 
ments. This is true for both constraint evaluation (87.1) and the learning of co-indexation 
(88.2). Concurrently, plausible assumptions about learning ensure that a diacritic phono- 
logical account does not suffer from overgeneration (85.3), and reveal the need for a mor- 
phological analytic bias, operationalized here as the Morphological Coherence Principle 
(86). Finally, a morph-based diacritic theory appears empirically insufficient in the in- 
evitably small number of cases that are informative (89.2). No doubt there is much more 
to be said on the topic of exceptionality, but I hope to have established that the nature 
of exceptionality is, in essence, phonological and diacritic. 


Abbreviations 


Abbreviations conform with the Leipzig glossing rules; in addition are: LEsT ‘lest’ and 
SET 'inclusion/one of a group’ (Dixon 19772). 


P? Except, trivially, in purely abstract theories (e.g. Lamb 1966, Fudge 1967). 
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U-umlaut in Icelandic and Faroese: 
Survival and death 


Hoskuldur Thrainsson 


University of Iceland 


Although Icelandic and Faroese are closely related and very similar in many respects, their 
vowel systems are quite different (see e.g. Anderson 1969b; Arnason 2011). This paper com- 
pares u-umlaut alternations in Icelandic and Faroese and shows that the Faroese umlaut has 
a number of properties that are to be expected if the relevant alternations are morphological 
(or analogical) rather than being due to a synchronic phonological process. In Icelandic, on 
the other hand, u-umlaut has none of these properties and arguably behaves like a living 
phonological process. This is theoretically interesting because the quality of the vowels in- 
volved (both the umlaut trigger and the target) has changed from Old to Modern Icelandic. 
In addition, u-umlaut in Modern Icelandic is more opaque (in the sense of Kiparsky 1973) 
than its Old Icelandic counterpart, i.e. it has more surface exceptions. An epenthesis rule in- 
serting a (non-umlauting) /u/ into certain inflectional endings is the cause of many of these 
surface exceptions. Yet it seems that u-umlaut in Icelandic is still transparent enough to 
be acquired by children as a phonological process. In Faroese, on the other hand, u-umlaut 
became too opaque and died out as a phonological rule. It is argued that this has partly 
to do with certain changes in the Faroese vowel system and partly with the fact that the 
u-epenthesis rule was lost in Faroese. 


1 Introduction 


Anderson put the process of u-umlaut in Icelandic on the modern linguistic map with 
the analysis he proposed in his dissertation (Anderson 1969b) and several subsequent 
publications (Anderson 1969a; 1972; 1973; 1974; 1976). Because of changes in the vowel 
system from Old to Modern Icelandic, the nature of the umlaut process changed some- 
what through the ages (see e.g. Benediktsson 1959). The most important part of u-umlaut, 
and the only part that is alive in the modern language, involves /a/ ~ /ọ/ alternations in 
the old language (phonetically [a] ~ [o], as shown in 2), which show up as /a/ ~ /ó/ alter- 
nations in the modern language (phonetically [a] ~ [ce], cf. 2). This is illustrated in (1) 
with the relevant vowel symbols highlighted: 
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(1) Old Icelandic: Modern Icelandic: 
saga ‘saga’, OBL SQgu, PL sọgur saga, OBL sÓgu, PL sÓgur 
hvass ‘sharp’, DAT hvgssum hvass, DAT hvóssum 
tala ‘speak’, Let, tolum tala, 1PL tölum 


As these examples suggest, the quality of the root vowel /a/ changes when a /u/ follows 
in the next syllable. The relevant proecesses can be illustrated schematically as in (2). For 
the sake of simplicity I use conventional orthographic symbols to represent the vowels 
and only give IPA-symbols for the vowels that are important for the understanding of 
the umlaut processes. The umlaut-triggering vowels are encircled:! 


(2) a. u-umlaut in Old Icelandic and the system of short vowels: 


[-back] [+back] 
[-round] [+round] [round] [+round] 
[+high] i y (à [u] 
e o o 
[+low] e a [a] — o [9] 
b. u-umlaut in Modern Icelandic and the system of monophthongs:? 
[-back] [+back] 
[round]  [*round] [round] [+round] 
[+high] i ú [u] 
i lé 
[+low] e 6 [oe] ——- a [a] o [5] 


The gist of Anderson's analysis of u-umlaut can then be illustrated semi-formally as 
in the traditional generative phonological notation in (3), with the assimilating features 
highlighted (see also Rögnvaldsson 1981: 31, Thráinsson 2011: 89-90)? 


(3 a. u-umlaut in Old Icelandic: 
(ai > |«round]| / _CoV +round 
+high 
+back 


b. u-umlaut in Modern Icelandic: 


+round 
fal > | back | / Co [round 


-back 


-low 


1 Note that in the representation of the Modern Icelandic vowel system, the accents over vowel symbols 
have nothing to do with quantity but simply denote separate vowel qualities. Thus /i/ is [i], /i/ is [1], /à/ is 
[u] and /u/ is [v], as the schematic representation in (2) suggests. 

?Iam assuming here, like Thráinsson (1994) and Gíslason & Thráinsson (2000: 34), for instance, that Mod- 
ern Icelandic only distinguishes between three three vowel heights and that /e/ [e] and /ó/ [ce] are both 
phonologically [+low], like /a/ [a] and /o/ [9]. For different assumptions see e.g. Árnason (2011: 60). 

3 Here, and elsewhere in this paper, I will use the kinds of formulations of rules and conditions familiar 
from classical generative phonology since much of the work on u-umlaut has been done in that kind of 
framework. For analyses employing more recent frameworks see Gibson & Ringen 2000, Hansson 2013 


and Ingason 2016. Most of the argumentation in this paper should be relatively framework-independent, 
however. 
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As the illustration in (3) shows, the modern version of the umlaut is somewhat more 
complex than the old one, assimilating two features rather than one. Nevertheless, it is 
still arguably a phonologically (or phonetically) natural assimilation process, assimilat- 
ing rounding and backness. 

Although the u-umlaut discussion was most lively on the international scene in the 
1970s (see e.g. Iverson 1978; Iverson & Anderson 1976; Oresnik 1975; 1977, cf. also Valfells 
1967), the topic keeps popping up to this day, e.g. in journals and conferences dedicated 
to Scandinavian linguistics (see e.g. Gibson & Ringen 2000; Indridason 2010; Thrains- 
son 2011; Hansson 2013) and even in recent master’s theses and doctoral dissertations 
(see Markússon 2012; Ingason 2016). The main reason is that while u-umlaut in Modern 
Icelandic is obviously very productive, being applied consistently to new words and loan- 
words, it shows a number of intriguing surface exceptions. These have been discussed 
extensively in the literature cited but here I will concentrate on the most common and 
widespread one, namely the lack of umlaut before a /u/ that has been inserted between 
the inflectional ending /r/ and a preceding consonant. This epenthesis did not exist in 
Old Icelandic as illustrated in (4): 


(4) Old Icelandic: Modern Icelandic: 
dalr ‘valley’, latr ‘lazy’ dalur, latur 


If u-umlaut is a phonological rule in the modern language, this u-epenthesis has to fol- 
low it, as it did historically. This is one of the properties of u-umlaut that have been used 
to argue for the necessity of relatively abstract phonological representations and deriva- 
tions (e.g. Anderson 1969b; 1974; Rógnvaldsson 1981; Thráinsson 2011; Hansson 2013) 
while others have maintained that u-umlaut is not a phonological process anymore in 
Modern Icelandic and the relevant alternations are morphologized and purely analogi- 
cal (see e.g. Markússon 2012) or at least “morpheme-specific”, i.e. triggered by particular 
morphemes that may or may not contain a /u/ (Ingason 2016).* 

In this paper I will compare u-umlaut alternations in Modern Icelandic and Modern 
Faroese. This comparison will show very clearly that u-umlaut in Modern Faroese has a 


4 Ingason (2016: 220) formulates his umlaut rule as follows: 
Realize an underlying (ai as /ó/ in the syllable which precedes the morpheme which triggers the umlaut. 


As can be seen here, no mention is made of a triggering /u/ in the rule. The reason is that Ingason wants 
to derive all all paradigmatic /a/ ~ /ó/ alternations the same way, including the ones where /u/ has been 
syncopated historically. Thus he argues that the Mou. SG. morpheme -ø in feminine nouns like sök ‘guilt, 
case’ and the NoM./AccC.PL. morpheme -ø in neuter nouns like börn ‘children’ triggers umlaut the same way 
that the DAT.PL. morpheme -um does in sökum and börnum. But many researchers have wanted to distin- 
guish between morphologically conditioned umlaut, where there is no triggering /u/, and phonologically 
conditioned umlaut triggered by /u/, e.g. Rógnvaldsson (1981). One reason for doing so comes from the 
behavior of loanwords like the adjective smart ‘smart, chic’. Here the NoM.sc.r and the NOM/ACC.PL.N can 
either be smart or smórt, i.e. a morphologically conditioned umlaut may or may not apply. But once an 
umlauting inflectional ending containing /u/ is added to the loanword smart, the u-umlaut becomes obliga- 
tory. Thus DAT.PL can only be smórt-um and not *smart-um and the NoM.Pr.wx form has to be smórt-u and 
not *smart-u. This suggests that the morphologically conditioned umlaut is more prone to exceptions than 
the phonologically conditioned one, which is actually to be expected. Thanks to Eirikur Rógnvaldsson for 
pointing this out to me. 
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number of properties (e.g. paradigm levelling, various kinds of exceptions, total absence 
from certain paradigms, inapplicability to loanwords ...) that are to be expected if the 
relevant alternations are no longer due to a synchronic process. In Modern Icelandic, on 
the other hand, u-umlaut has none of these properties and behaves more like a phono- 
logical rule. This is of general theoretical interest since it illustrates how phonological 
rules can survive (in the case of Icelandic) despite reduced transparency (in the sense 
of Kiparsky 1973) and how changes in the phonological system can cause the death of a 
phonological rule (in the case of Faroese) and what the consequences can be. 

The remainder of the paper is organized as follows: In §2 I first illustrate how the 
u-epenthesis works in Modern Icelandic and then present a couple of arguments for the 
phonological (as opposed to morphological) nature of Modern Icelandic u-umlaut. §3 
first describes some facts about the Faroese vowel system that must have been impor- 
tant for the development of u-umlaut and then shows that u-epenthesis does not exist 
anymore as a phonological process in Modern Faroese. It is then argued that these de- 
velopments led to the death of u-umlaut as a phonological process in Faroese. §4 then 
contains a systematic comparison of u-umlaut alternations in Modern Icelandic and Fa- 
roese, concluding that the Faroese ones must be analogical (and morphological) in nature 
as they do not exhibit any of the crucial phonological properties that Modern Icelandic 
u-umlaut alternations show. In Icelandic, on the other hand, u-umlaut does not show the 
non-phonological properties listed for its Faroese counterpart. §5 concludes the paper. 


2 u-epenthesis and u-umlaut in Modern Icelandic 


2.1 The epenthesis rule 


The phoneme /r/ frequently occurs in Old Icelandic (Old Norse) as a marker of various 
morphological categories, including Nom.sc of strong masculine nouns and adjectives 
as illustrated in (5). It sometimes assimilated to a preceding consonant, e.g. /s, l, n/ (cf. 
5c)? but it was deleted after certain consonant clusters, such as / gl, gn, ss/ (cf. 5d): 


(5) a. stór-r ‘big’, mó-r ‘peat’, há-r ‘high’ 
b. dal-r ‘valley’, lat-r ‘lazy’, tóm-r ‘empty’, hard-r ‘hard’ 
c. is-s ‘ice’, laus-s ‘loose’, stól-I ‘chair’, fin-n ‘fine’ 


d. fugl ‘bird’, vagn ‘wagon’, foss ‘waterfall’ (stem foss-) 


It is likely that the /r/ in words of type (5b) was syllabic in Old Icelandic. There are 
no syllabic consonants in Modern Icelandic, on the other hand. Instead a /u/ appears 
between the /r/ and the preceding consonant in the modern version of words of type 
(5b). There is historical evidence for this u-insertion from the late thirteenth century and 


5 Assimilation to stem-final /l, n/ only happened in Old Icelandic if these consonants were preceded by 
long vowels, i.e. Old Icelandic diphthongs and vowels that are standardly represented by accented vowel 
symbols in Old Icelandic orthography, cf. stól-1 ‘chair’ vs. dal-r ‘valley’, fin-n ‘fine’ vs. lin-r ‘soft, limp’, 
heil-I ‘whole’ vs. hol-r ‘hollow’. 
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onwards (see e.g. Kristinsson 1992 and references cited there) and many linguists have 
argued that u-epenthesis is still a productive phonological process in Modern Icelandic 
(e.g. Anderson 1969b,a; Orešnik 1972; Rögnvaldsson 1981; Kiparsky 1984).° This implies 
that speakers distinguish between a -ur-ending where the underlying morpheme is #-r# 
and the /u/ is epenthetic (and does not trigger u-umlaut) and a -ur-ending where the /u/ 
is not epenthetic and the underlying morpheme is #-ur# (and the /u/ triggers u-umlaut). 
This contrast is illustrated in (6a) vs. (6b) (see also the examples in 1 and 4 above): 


(69 a. #dal+r# ‘valley’ Nom.sc.m — dal-ur by epenthesis no umlaut 
#lat+r# ‘lazy’ NOM.SG.M — lat-ur by epenthesis no umlaut 

b. #sag+ur# ‘sagas’ ^ NOM.PLF — sÓg-ur u-umlaut 
#tal+ur# ‘numbers’ NoM.PLF — töl-ur u-umlaut 


Thus the Nom.sc ending #-r#, which is both found in strong masculine nouns like dalur 
‘valley’ and in the strong masculine form of adjectives like latur ‘lazy’, does not have the 
same properties as the NoM.PL ending #-ur# which is found in feminine nouns like sógur 
‘sagas’ and tólur ‘numbers’. Despite their surface similarities in certain environments, 
speakers can clearly distinguish these endings. A part of the reason must be that the 
NOM.SG.M ending #-r# only shows up as -ur in phonologically definable environments, 
i.e. the modern version of words with stems of type (5b), whereas the NoM.PL.F ending #- 
ur# is not so restricted and always shows up as -ur. This is illustrated in Table 1 (compare 
the examples in 5). 

Comparison of Table 1 and the Old Icelandic examples in (5) reveals a slight extension 
of r-deletion: The /r/ of the morphological ending #-r# is now deleted after /r/ (compare 
line d of the table to 5a) and after all instances of /s/, not just /ss/ (compare line d of the 
table to (5c,d)). The u-epenthesis illustrated in line b of Table 1 is an innovation, of course. 
Otherwise the NoM.sc.M ending behaves in much the same way as in Old Icelandic. The 
different behavior of the morphemes compared in Table 1 can be seen as an argument 
for distinguishing them in the underlying form, e.g. for not analyzing the N.sc.m ending 
as #-ur#. 


2.2 Some phonological properties of Modern Icelandic u-umlaut 


In this section I will mention two sets of facts which show that u-umlaut still has certain 
properties in Modern Icelandic that are to be expected if it is a phonologically condi- 
tioned process. 


6 Orešnik later maintained that u-epenthesis could not be a synchronic rule in Modern Icelandic because 
of the existence of exceptional word forms like klifr ‘climbing’ (from the verb Klifra ‘climb’), sótr ‘slurp- 
ing’ (from the verb sötra ‘slurp’), pukr 'secretiveness' from the verb pukra(st) ‘be secretive about’, etc. 
(Ore&nik 1978; see also the discussion in Kjartansson 1984). In words of this kind one would have expected 
u-epenthesis to apply. The importance of these exceptions is not very clear since this is a very special class 
of words (all derived from verbs ending in -ra) and it is typically possible or even preferred to apply the 
epenthesis rule to these forms, giving Klifur, sótur, pukur, etc. For the sake of completeness it should be 
noted that the final -r in word forms like sótr, pukr has to be voiceless and this may be related to the fact 
that there are no syllabic consonants in Modern Icelandic, as stated above. 
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Table 1: Phonological realization of the inflectional endings #-r# and #-ur# in 


Modern Icelandic. 


type of stem 


phonological realization of 
the Nom.sG.M ending #-r# 


phonological realization of 
the NoM.PL.F ending #-ur# 


ending in a vowel 


ending in a sin- 
gle consonant 
(but see c) 
ending in a high 


vowel + /Lln/ 


ending in /s, r/ or 
consonant 
clusters ending 
in /1, n/ such as 


-r 

(mó-r ‘peat’, há-r ‘high’) 
-ur 

(dal-ur ‘valley’, lat-ur lazy’) 


assimilation 


(stól-I ‘chair’, fin-n ‘fine’) 


deletion 
(is ‘ice’, laus ‘loose’, foss 
‘waterfall’ bjór ‘beer’, stór 


‘big’, fugl ‘bird’, vagn 


-ur 

(ló-ur ‘golden plovers’) 
-ur 

(sóg-ur ‘sagas’, tol-ur 
‘numbers’) 

-ur 

(spus-ur ‘wives’, sül-ur 
‘columns’, dyn-ur 
‘mattresses’) 

-ur 

(ys-ur ‘haddocks’, aus-ur 
scoops’, hér-ur ‘whores’, 
ugl-ur ‘owls’, hrygn-ur 
spawning fish', byss-ur 


/gl, gn/ i 
wagon’) 


‘guns’) 


First, if u-umlaut was morphologically conditioned and not phonologically, we would 
expect it to be restricted to certain morphological categories or parts of speech. It is 
not. It applies in the paradigms of nouns, adjectives and verbs when a /u/ follows in the 
inflectional ending (with the exception of the epenthetic /u/ already mentioned). This is 
illustrated in (7): 


(7 a. saga ‘saga’, OBL.SG s6g-u, NOM/ACC.PL s6g-ur, DAT.PL sÓg-um 
b. snjall ‘smart’, DAT.sG.M snjóll-um, NOM.PL.WK snjóll-u 


c. kalla ‘call’, 1.pu.prs kéll-um, 3.p1.Psr kólluó-u 


The so-called i-umlaut is very different in this respect. It is clearly not alive as a phonolog- 
ical rule anymore but its effects can still be observed in the modern language in certain 
morphologically definable environments. As a result we can find near-minimal pairs of 
word forms where i-umlaut has applied in one member but not the other although the 
phonological conditions seem identical. Some examples are given in (8): 


(8) a. háttur ‘mode’, pat.sc haett-i/* hátt-i, NoM.PL haett-ir/* hátt-ir 


b. sáttur ‘satisfied’, NOM.SG.M.WK "szett-i/sátt-i, NOM.PL.M "saett-ir/sátt-ir 


In (8a) we see examples of the paradigmatic alternation /á ~ æ/ (phonetically [au] ~ [ai] in 
the modern language, probably [a:] ~ [e:] in Old Icelandic) originally caused by i-umlaut. 
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In the Nom.sc we have /á/ in the stem but in the pat.sc the only acceptable form is hætti 
and the “non-umlauted” version "hátti is unacceptable. Similarly, in the Nom.PL only 
haettir is acceptable and "háttir is not. At a first glance we might think that an /i/ in 
the inflectional ending is still causing this “umlaut” but a comparison with the adjecti- 
val forms in (8b) indicates that this cannot be the case. Here the only acceptable weak 
NOM.SG.M form is sátti and not “sætti and the only NoM.PL.M form is sáttir and not " saettir. 
Sothe i-umlaut alternations in Modern Icelandic are clearly morphologically conditioned 
and not phonological anymore (see also Thráinsson 2011: 93 for further examples of this 
kind). 

Second, recall that standard generative phonology formulations of u-umlaut in Ice- 
landic of the kind illustrated in (3b) above state explicitly that /u/ only triggers umlaut 
of /a/ in the immediately preceding syllable. This is illustrated by examples like the fol- 
lowing: 


(9) a. bakki ‘bank’ par.Pr bókk-um/"bakk-um 


b. akkeri ‘anchor’ DAT.PL *ókker-um/akker-um 


In (9a) the u-umlaut obligatorily applies to the root vowel /a/ in the immediately preced- 
ing syllable. In (9b), on the other hand, the /u/ in the (same) inflectional ending cannot 
apply to the root vowel /a/ because there is a syllable intervening. An interesting and 
much discussed case, e.g. by Anderson in several of the publications cited above, in- 
volves trisyllabic words with two instances of /a/ in the stem. Consider the examples in 
(10): 


(10) a. kalla call 
1.sc.pst kalla-ó-i, 1.pu.pst *kalló-ó-um/kóllu-ó-um/"kallu-Ó- um/*kélla-d-um 


b. banan-i ‘banana’ 
DAT.PL banén-um/bénun-um/*banun-um/*bénan-um 


Consider first the conceivable 1.Pr.PsT forms of the verb kalla ‘call’. Based on the formu- 
lation (3b) of the u-umlaut rule, one might have expected the form *kallédum, where the 
/u/ in the inflectional ending triggers u-umlaut of the /a/ in the preceding syllable. This is 
not an acceptable form, however. The reason is that in forms of this sort a “weakening” 
of unstressed /ó/ to /u/ is obligatory. This weakening is found in in many words, e.g. the 
plural of the word héraó ‘district’, plural héróó or (preferred) hérud, medal ‘medicine’, 
plural meóól or (preferred) medul. It is not always obligatory but it seems that in the 
past tense of verbs of this sort it is. But once the (umlauted) /ó/ in "kallóóum has been 
weakened to /u/ it obligatorily triggers u-umlaut of the preceding /a/ so kólluóum is ac- 
ceptable but *kalluóum is not. Finally, the form *kóllaóum is not acceptable either, since 
there u-umlaut would be applied across an intervening syllable, which is not possible, 
as we have seen (cf. 9b). The u-umlaut works in a similar fashion in the word banani, 
except that here the weakening of the second (and unstressed) syllable from /6/ to /u/ is 
not obligatory. Hence banónum is an acceptable form, with the /u/ in the final syllable 
triggering u-umlaut of the preceding /a/ to /ó/. But if this /ö/ is further weakened to 
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/u/, then u-umlaut of the first /a/ is obligatory and bönunum is an acceptable form but 
*banunum is not.” As predicted by the formulation of the u-umlaut rule in (3b) a form 
like *bönanum is unacceptable because there the u-umlaut would have applied across an 
intervening syllable. Facts of this sort have been interpreted as showing that u-umlaut 
in Modern Icelandic is of a phonological nature since it depends on syllabic structure 
(no syllables can intervene between the umlaut trigger and the target) and it can be ap- 
plied iteratively (a /u/ which itself is derived by u-umlaut and subsequent independently 
needed weakening can trigger u-umlaut). 


3 The conditions for u-umlaut in Modern Faroese 


3.1 u-umlaut and the Modern Faroese vowel system 


Modern Faroese has preserved some u-umlaut-like vowel alternations. A couple of ex- 
amples are given in (11) (see also Thráinsson et al. 2012: 78, 100, passim): 


(11) dag-ur ‘day’, DAT.PL deg-um; spak-ur ‘calm’, NOM.PL.WK spok-u 


At first glance, these alternations seem very similar to the Icelandic ones described in 
the preceding sections. But while the u-umlaut alternations are arguably phonologically 
(or phonetically) natural in Modern Icelandic (see the diagram in 2b and the formulation 
in 3b), it will be claimed below that this is not the case in Faroese. To demonstrate this, 
it is necessary to look closely at the Faroese vowel system. Consider first the follow- 
ing schematic representation of Faroese u-umlaut of the type just illustrated, where the 
alleged umlaut trigger is encircled (cf. Thráinsson 2011: 98, Thráinsson et al. 2012: 33, 
compare Árnason 2011: 248-250): 


(12) u-umlaut in Modern Faroese and the system of monophthongs: 


[-back] [+back] 
[-round] [+round] [-round] [+round] 
[+high] i y (à) [u:/o] 
e > 9 [e:/ce] o 
[+low] ae [ea:/a] a 2 


Something like (13) would seem to be a possible formulation of a process of this kind 
in traditional generative phonology terms (compare 3b): 


7 It is sometimes claimed that bónónum is also an acceptable form for some speakers. If this is so, it is possible 
that the /ö/ in the next-to-last syllable triggers u-umlaut (i.e. ó-umlaut!) of the /a/ in the first syllable. 
That would simply mean that the feature [-low] in the definition of the environment of the u-umlaut in 
(3b) would be omitted. But since there are no derivational (nor inflectional) morphemes containing an 
underlying /6/ (i.e. an /ó/ that cannot have been derived by u-umlaut), this proposal cannot be tested 
independently of the iterative rule application, as pointed out by a reviewer. 

8 Vowel length is predictable in Faroese, as it is in Icelandic: Vowels are long in stressed open syllables, 
otherwise short. As illustrated in the brackets in (12), there is often a considerable difference in the phonetic 
realization of the long and short variants. This will be illustrated below. — It should be noted that Árnason 
(2011: 76) assumes a different analysis of Faroese monophthongs. 
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(13) Possible phonological formulation of u-umlaut in Modern Faroese: 


ase oe / CVV [ «round 


+back 


-low 


Presented this way, u-umlaut in Faroese looks like a fairly natural assimilation rule at 
a first glance.’ But the facts are somewhat more complicated. 

First, the alleged trigger /u/ is not too stable in Modern Faroese. The reason is that 
unstressed /i,u/ are not distinguished in all Faroese dialects. In some dialects they merge 
into an [1]-like sound, in others into an [o]-like sound but some dialects distinguish them 
as [1] and [o] (see Thráinsson et al. 2012: 27, and references cited there). This situation has 
clearly added to the phonological opacity of u-umlaut alternations for speakers acquiring 
Faroese. 

Second, the target of the u-umlaut in Faroese is arguably a “moving” one. As indicated 
in (12), the umlaut affects the phoneme represented there as /æ/. As the orthography 
suggests, it is a descendant of Old Norse /a/ in words like dagur, spakur (see 11). It is 
realized phonetically as [ea:] when long and [a] when short, as shown in (12), cf. spakur 
[spea:"ko1] ‘calm’, sc.n spakt [spakt] (see Thráinsson et al. 2012: 34 passim). But in the 
history of Faroese Old Norse /a/ [a] and /æ/ [e:] merged so the phoneme represented here 
as /ze/ can also be a descendant of Old Norse /ze/ and then it is represented in the spelling 
as ‘x’, cf. trælur [t'1ea:o1] ‘slave’, æða [ea:va] ‘eider duck’. Words written with o" show 
the same alternation between long [ea:] and short [a] as demonstrated for spakur and 
spakt above (e.g. vænur [vea:no1] ‘beautiful’ sc.M vs. vant [vant], cf. Thráinsson et al. 
2012, p. 34). Yet it seems that u-umlaut is rarely if ever found in the ‘e’-words. Thus 
the pat pt, of trælur is trælum and not "trelum (compare Dat St. dølum of dalur ‘valley’) 
and although the words æða 'eider duck’ and ada ‘(big) mussel’ sound the same, i.e. as 
[ea:va], the par pt, of the former has to be æðum [ea:von] and evum [e:von] can only be 
DAT.PL of aða. 

To further complicate matters, the development of Old Norse /a/ in Faroese has left 
"room" for a “regular /a/" in the Faroese vowel system, as shown in the diagram in (12). 
It occurs in loanwords and is realized as [a:] when long and [a] when short, cf. Japan 
[japan], japanskur [japh^ansko1] ‘Japanese’." It does not seem that this vowel ever 
undergoes u-umlaut in Faroese (for further discussion see 84). 


? A reviewer suggests, however, that a process changing rounding and height as formulated for Faroese in 
(13), might be less natural from the point of view of acoustic phonetics than a process changing rounding 
and backness the way the u-umlaut rule in Modern Icelandic does according to (3): The former affects 
both F1 (for the height difference) and F2 (for rounding) whereas the latter affects F2 in opposite directions 
(raising it for fronting but lowering it for rounding). Thus the Modern Icelandic u-umlaut rule would 
"generate more similar input-output mappings", which may be preferred to less similar ones. 

10 A reviewer points out that the fact that u-umlaut does not apply do "oe" words in Faroese suggests that 
“u-umlaut had already taken on a morphological character before /a/ and /z/ merged? But since there are 
no written records of Faroese from 1400-1800, the historical development of the language is very murky. 

H Tn the noun Japan the stress falls on the first syllable, in the adjective japanskur it falls on the second one 
as indicated. Hence the quantity alternation in the first vowel. 
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Finally, there is no u-epenthesis in Modern Faroese to “explain away” apparent excep- 
tions to u-umlaut as will be shown in the next section. 


3.2 The lack of u-epenthesis in Modern Faroese 


Now recall that the most obvious surface exception to u-umlaut in Modern Icelandic is 
due to the u-epenthesis described above. This rule creates -ur-endings that do not trigger 
u-umlaut. It was argued that this epenthesis rule is still productive in Icelandic, witness 
the fact that it only applies in phonologically definable environments. Hence there is 
a clear distributional difference between -ur-endings produced by the epenthesis rule 
(and not triggering u-umlaut) and -ur-endings where the /u/ is a part of the underlying 
form (and triggers umlaut). This is not the case in Faroese, where the ending -ur as a 
marker of the Nom.sc of strong masculine nouns and adjectives, with a /u/ that was his- 
torically inserted by epenthesis, has been generalized to all environments. Hence it has 
become distributionally indistinguishable from other -ur-endings. Table 2 compares the 
phonological realization of the NoM.sG.M #-r#-ending in Modern Icelandic to its Modern 
Faroese counterpart (see also Thráinsson 2011: 100): 


Table 2: Phonological realization of a strong NoM.sG.M-ending in Modern Ice- 
landic and Modern Faroese. 


type of stem phonological realization ofa phonological realization of a 
strong NOM.SG.M ending in strong NOM.SG.M ending in 
Modern Icelandic Modern Faroese 
a. endinginavowel -r -ur 
(mó-r ‘peat’, há-r ‘high’) (mó-ur/mógv-ur ‘peat’, há-ur 
‘high’) 
b. ending in a sin- -ur -ur 
gle consonant (dal-ur ‘valley’, lat-ur ‘lazy’) (dal-ur ‘valley’, lat-ur ‘lazy’) 
(but see c) 
c. ending ina high er -ur 
vowel + /ln/ assimilation (stól-ur ‘chair’, fin-ur ‘fine’) 
(stól-I ‘chair’, fin-n ‘fine’) 
d. ending in /s, r/ or . -ur 
consonant deletion f (is-ur ‘ice’, leys-ur ‘loose’, 
clusters like /gl, (is ice’, laus loose’, foss foss-ur ‘waterfall’, stór-ur 
gn/ waterfall’, stór ‘big’, fugl ‘big’, fugl-ur ‘bird’, vagn-ur 


‘bird’, vagn ‘wagon’) ‘wagon’) 


This has clearly made the u-umlaut rule in Faroese more opaque since now the non- 
umlauting and umlauting ur-endings occur in the same phonological environments. It 
seems very likely that this has contributed to the death of u-umlaut as a phonological 
process in Faroese. 
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4 Testing the predictions 


In the preceding discussions I have described /a ~ 6/ alternations in Modern Icelandic 
and their Modern Faroese counterparts. I have argued that the Icelandic alternations are 
still governed by a synchronic phonological process. Although these alternations are still 
found in Modern Faroese, I have argued that they cannot be governed by a phonological 
rule. Instead they must be morphologically governed or analogical. This analysis makes 
several testable predictions (see Thrainsson 2011: 100-102). 

First, we do not a priori expect phonologically conditioned alternations to be restricted 
to particular morphological categories whereas morphologically conditioned alterna- 
tions obviously are, by definition. As we have already seen, the Icelandic u-umlaut occurs 
in the inflectional paradigms of nouns, adjectives and verbs and in various grammatical 
categories (cases, numbers, tenses, persons ...). Its Faroese counterpart behaves differ- 
ently. It is found in the inflectional paradigms of nouns and adjectives, as we have seen 
(cf. 11), but not in the past tense forms of verbs, where it would be expected on phono- 
logical grounds. Thus we have vid kólluóum in Icelandic vs. vit kallaóu in Faroese for 
1.PL.PST ‘we called’, and vió frümdum vs. vit framdu in Faroese for 1.Pr.PsT ‘we did, made’. 

Second, a phonological rule should not allow analogical extensions to forms that do 
not fit its structural conditions. Such extensions are not found for Icelandic u-umlaut 
but in Faroese they are very common. Thus the /o/ of the oblique cases segu ‘saga’ has 
been analogically extended to the Nom.sc form sega and many other words of a similar 
type. The corresponding form *sóga is unacceptable in Icelandic.” 

Third, a phonologically conditioned rule should apply whenever its structural condi- 
tions are met. Thus we would not expect to find inflectional forms in Icelandic where 
u-umlaut fails to apply in an appropriate environment. Such examples are very common 
in Faroese, on the other hand. Thus the parpt of the noun rakstur ‘shave’ in Faroese 
is rakstrum and not the expected "rokstrum, the Dat pt. of spakur ‘calm’ can either be 
spekum or spakum, etc. (see Thráinsson et al. 2012: 79, 100, passim). Corresponding 
unumlauted forms are unacceptable in Icelandic. 

Fourth, there is evidence for "iterative" application of u-umlaut in Icelandic, with one 
application of the u-umlaut rule feeding another. This was discussed above (second 
part of $2.2) in connection with forms like 1.Pr.Psr kölluðum '(we) called’ and DAT.PL 
bónunum ‘bananas’. No such evidence is found in Faroese, where the corresponding 
forms are kallaóum and bananum.” 


12 As a reviewer reminds me, the Icelandic neologism for computer is interesting in this connection. It was 
supposed to be tólva (related to the word tala ‘number’ — this was when computers were mainly used for 
computing) in NOM.scG, oblique singular cases tölvu. In Proto-Nordic time /v/ could trigger umlaut of /a/ 
to /o/ so we have Old Norse words like volva ‘sooth-sayer, witch’. But since /v/ is not a trigger of umlaut 
in Modern Icelandic (witness loanwords like salvi ‘salve, cream’), speakers tend to use the form talva for 
NOM.SG, thus in a way undoing the underlying /ó/ in the nominative as if they are “assuming” that the /ó/ 
in the oblique cases is derived by a synchronic u-umlaut from /a/, as in words like saga ‘saga’, oblique sógu 
(for some discussion see Thráinsson 1982). 

35 The latter form may be related to the fact that banan ‘banana’ is a loanword and contains the vowel /a/ 
(long variant [a:]) and not /æ/, cf. the discussion in §3.1. See also the next paragraph. 
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Finally, Icelandic u-umlaut is so productive that it is naturally applied in loanwords, 
as we have seen. This is not so in Faroese. Thus the word app (for a small program) has 
been adopted into both languages. In Icelandic the pAT.Pr has to be öppum whereas the 
natural form is appum in Faroese. This can easily be verified by searching for the word 
combinations med óppum and vió appum ‘with apps’ on Google. For the first variant one 
finds a number of Icelandic hits, for the second Faroese ones. 

The general conclusion, then, is that u-umlaut in Modern Icelandic has a number of 
properties that are to be expected if it is a phonological process but none of the properties 
one might expect of morphologically conditioned or analogical alternations. 


5 Concluding remarks 


While it has often been argued that phonology need not be “natural” (see e.g. Anderson 
1981), there must obviously be limits to the “unnaturalness” and opacity of phonological 
processes. Once they become too unnatural and opaque, they can no longer be acquired 
as such and the phonological alternations originally created by them will be relegated to 
morphology. Then their productivity will be limited and it will at best survive to some 
extent by analogy, but analogical processes are known to be irregular and unpredictable. 
The fate of i-umlaut in Icelandic is a case in point, as described above (see the discussion 
of the examples in 8). But whereas we do not have detailed information about how i- 
umlaut died as a phonological process, comparison of the development of u-umlaut in 
Icelandic and Faroese sheds an interesting light on how a phonological rule can die and 
how it can survive despite changing conditions. 
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Part II 


Morphology 


Chapter 6 


Word-based Items-and-processes 
(WoBIP): Evidence from Hebrew 
morphology 


Outi Bat-El 


Tel-Aviv University 


In his seminal book A-Morphous Morphology, Anderson provides ample evidence support- 
ing the item-and-process approach to morphology, whereby relations between words, and 
thus the derivation of one word from another is expressed in terms of processes. Although 
Anderson excluded Semitic languages from the paradigm, I argue in this paper for the ad- 
vantage of item-and-process in the analysis of Modern Hebrew word relations. Under this 
approach, the word/stem is the base, and the putative consonant root is just a residue of 
phonological elements, which are lexically prominent as are consonants in non-Semitic lan- 
guages. The empirical basis of the arguments is drawn from natural and experimental data 
of adult Hebrew as well as child Hebrew. 


1 Introduction 


"Items vs. processes in morphology" is the title of §3.4 in Anderson’s (1992) seminal book 
A-Morphous Morphology. In this section, Anderson compares two models of morphol- 
ogy — item-and-arrangement and item-and-process (attributed to Hockett 1954) — and 
argues in favor of the latter. Taking apophony (or ablaut; e.g. sing — sang) as one of the 
many problems encountered with the item-and-arrangement model, Anderson claims 
that “what presents {Past} in sang is ... the relation between sang and sing, expressed as 
the process by which one is formed from the other" (Anderson 1992: 62; emphasis orig- 
inal). The process in this case is replacement (or stem modification); "the PAsT form of 
sing is formed by replacing /1/ with /æ/? Crucially, /ze/ is not the morpheme designating 
PAST, and sang is not derived by combining bound morphemes, i.e. s-ņ and -æ-. 

The section which immediately follows in Anderson's book (83.5) is titled “Word- 
based vs. morpheme-based morphology". The issues addressed in these two sections 
are always considered together, since one is contingent upon the other. A root-based 
morphology is usually analyzed within the item-and-arrangement model. However, if 
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morphology is word-based, the debate between item-and-arrangement and item-and- 
process still holds (see §2). This debate is particularly heated in the study of Semitic 
morphology, where a consonantal root has been claimed to be the core morphological 
unit in the word. 

Paradigms like sing — sang are relatively rare in English, but abundant in Semitic 
languages, such as Hebrew, where the relation between words is often expressed with 
apophony; e.g. yam — yom ‘hot — heat’, limed — limud ‘to teach - learning’, fuman - 
fémen ‘fat — oil’, gadol -gódel ‘big — size’ (stress is final unless otherwise specified). 
Since item-and-arrangement has been the traditional approach to Semitic morphology, 
and has been supported by traditional Semiticists (see, however, §6) and generative lin- 
guists, Anderson contemplates whether sing — sang can be analyzed as a root s-ņ plus 
the markers Ju ‘PRESENT’ and /ze/ ‘past’. He, however, rejects this analysis due to the 
absence of "substantive evidence in its favor" (Anderson 1992: 62), and adds in paren- 
theses “as there clearly is ... for something like McCarthy’s analysis of Arabic and other 
Semitic languages" (ibid). That is, Anderson accepts the common view that item-and- 
arrangement is the appropriate model for Semitic morphology. 

While I support Anderson's approach to morphology, I do not agree with the exclusion 
of Semitic languages from the paradigm. On the basis of data from Modern Hebrew, I 
provide in this paper evidence supporting the word-based item-and-process (WoBIP) 
model for Semitic morphology. That is, English is not like Hebrew, but rather Hebrew is 
like English. 

In the context of Semitic morphology, I outline in the following $2 the possible mor- 
phological models that can be derived from the four different approaches: word-based, 
morpheme-based, item-and-process, and item-and-arrangement. Then, in 883-5 I pro- 
vide supporting evidence for the word-based item-and-process model, but due to space 
limitation, I do not dwell on arguments against competing models. Each piece of evi- 
dence supports only part of the model, but together we get a well-motivated model of 
morphology. Given Anderson's commitment to the history of linguistics (see, in particu- 
lar, Anderson 1985), I devote 86 to two principal Semiticists from the 19th century, whose 
grammar books support the word-based item-and-process model. Concluding remarks 
are given in 87. 


2 Models of morphology 


Research in morphology often concentrates on two questions: What is listed in the lex- 
icon and how are words derived? Each of these questions is associated with competing 
approaches. The what-question is related to the root-based vs. word-based debate, which 
is of particular interest in the study of Semitic morphology, where the root is always 
bound. The how-question is related to the item-and-process vs. item-and-arrangement 
debate. Together, they give rise to three models of morphology, shown in Figure 1: root- 
based item-and-arrangement, word-based item-and-arrangement, and word-based item- 
and-process. 
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Root-based Word-based 

What 
is listed Roots and configura- Words and configurations 
in the tions 
lexicon? 
H i. Extraction 

Ti Association of roots ii. Association of Imposing configura- 
Keng and configurations roots and con- tions on words 
derived? 


figurations 


Item & Arrangement Item & Process 


Figure 1: Models of morphology. 


In this paper I support the word-based item-and-process (WoBIP) model. Before dis- 
playing the supporting arguments, a short review of the three models is given in the 
three ensuing subsections. 


2.1 Root-based item-and-arrangement 


In the context of Semitic morphology, the root-based morphology teams up with item- 
and-arrangement. According to the traditional approach, the root in Hebrew and other 
Semitic languages consists of 2-4 consonants (3 in most cases) and is combined with a 
configuration (Bat-El 2011), where the latter, traditionally termed mishkal for nouns and 
binyan for verbs, is a shorthand for the grouping of prosodic structure, vocalic pattern, 
and affixes (if any).” In a configuration like miCCéCet, for example, the C-slots host the 
root consonants, the specified consonants (m and t) are affixes, and the vowels are part 
of the vocalic pattern (e.g. mivséfet ‘brush’, mizyélet ‘sleigh’, mifméset ‘guard’). Table 1 
shows examples of words sharing a root and examples of words sharing a configuration. 

The classical studies seem to suggest a lexical representation consisting of morphemes, 
as can be inferred from Moscati’s (1980: 71) account of the Semitic morphological system: 


1 I do not consider here the pluralistic approaches, whereby some words are derived from roots and others 
from words (McCarthy 1979; Arad 2005; Berman 2012), because all phenomena can be accounted for within 
the WoBIP model reviewed in §2.2. 

? Each of these elements (i.e. the prosodic structure, the vocalic pattern, and the affix) is independent (Mc- 
Carthy 1979; 1981), but here reference to the configuration suffices. In this context, we should note that 
the term “Semitic morphology" refers to morphology that employs configurations consisting of at least a 
vocalic pattern and prosodic structure. Of course, Hebrew and other Semitic languages employ the more 
conventional affixal morphology, but this type of morphology does not concern us here. 

3 Some words get additional, idiomatic meaning. For example, sidus carries the general meaning 'arrange- 
ment’ and the more specific one referring to ‘a prayer book’. Similarly, sédes carries the general meaning 
‘order’ and the more specific one referring to ‘Passover ceremony’ (sédeg pésay). 

^ As in other studies, the exponent of the 3 person masculine past serves as the citation form because it is 
structurally neutral, i.e. it has no affixes. The gloss is still in the infinitive, implying reference to the lexeme. 
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Table 1: Roots and configurations. 


Words sharing the root vsdg 


CiCeC Jadp ` sidex ‘to arrange’ 
CaCiC sadig ‘regular’ 

CiCuC sidux ‘arrangement’? 
CeCeC sédeg ‘order’ 
meCuCaC mesudag ‘arranged’ 
miCCaC misdax ‘military parade’ 
miCCeCon misderon ‘corridor’ 
CaCCan sadran ‘usher’ 

CiCCa sidga ‘series’ 


Words sharing a configuration? 


CaCCan Jsdg sadgan ‘usher’ 


Jgkd ` pakdan ‘dancer’ 


Jbtl — batlan ‘lazy’ 
CeCeC vbgd  béged ‘garment’ 

Jil jéled ‘boy’ 

vdgl dégel ‘flag’ 
CiCeC Jupe  xipes ‘to search’ 

Jbtl ` bitel ‘to cancel’ 

vxbs  xiber ‘to connect’ 


“The Semitic languages present a system of consonantal roots (mostly triconsonantal), 
each of which is associated with a basic meaning range common to all members of that 
root: e.g. ktb ‘to write’, qbr ‘to bury’, grb ‘to approach’, etc. These roots (root morphemes) 
constitute a fundamental category of lexical morphemes.” If roots are listed, so are the 
configurations, and word formation thus consists of associating roots and configurations, 
i.e. item-and-arrangement. 

As Hoberman (2006: 139) notes, "students of Semitic languages find the concept of the 
root so convenient and useful that one finds it hard to think about Semitic morphology 
without it? However, researchers vary with respect to the definition of the term “root”. 
Lipiński (1997: 202), for example, assumes that “Semitic roots are continuous morphemes 
which are instrumental in derivation but subject to vocalic and consonantal change ... 
based on continuous or discontinuous ‘pattern morphemes”’ (emphasis original). The 
"continuous morphemes”, which Lipiński calls roots, are not the traditional consonantal 


ER 


roots, but rather stems consisting of vowels and consonants; the “pattern morphemes” 
are what I call configurations. Aronoff (2007) drains the original morphological (struc- 
tural and semantic) properties from the root, claiming that it does not have to be linked 
to meaning and its phonology can be vague. Yet another use of the term “root” is found 
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in Frost, Forster & Deutsch (1997) with reference to an orthographic root, which as the 
results of their experiments suggest, has no semantic properties. 


2.2 Word-based item-and-process (WoBIP) 


Within this approach, the word or the stem is the core element to which all the required 
processes apply (Aronoff 1976). As a core element, it does not have an internal morpho- 
logical structure. The processes are operations (Anderson 1992: 72) that modify the basic 
form (Matthews 1974: 97). Indeed, the most common process in languages is the one 
deriving bats from bat, i.e. affixation, but there are other processes, such as apophony, 
which derives teeth from tooth. 

Also in the context of Semitic morphology, the input is a word/stem to which several 
processes apply (see §3.1.2 for word vs. stem as the base). The processes vary accord- 
ing to the goal, and the goal is that the output fits into a configuration. Such a goal- 
or output-oriented phenomenon, called stem modification (Steriade 1988; McCarthy & 
Prince 1990), is best analyzed within the framework of Optimality Theory (Prince & 
Smolensky 1993/2004), as shown in analyses of Semitic morphology, such as McCarthy 
& Prince (1993); Ussishkin (1999; 2000); Gafos (2003); Bat-El (2003). 

The details of the required modification depend on the structural similarity between 
the base and the output; the more similar they are, the fewer the required adjustments. 
Any element in the configuration can be modified - the vocalic pattern, the prosodic 
structure, and/or the affix. The modification, however, is contingent upon the configu- 
ration of the output. 


Table 2: Stem modification - modifying elements in the configuration. 


Base form Derived form Modified elements 
sabon ‘soap’ —  siben ‘to soap’ vocalic pattern 

tipel ‘to take care of — me-tapel ‘caretaker’ vocalic pattern, affix 
matok ‘sweet’ — ma-mtak ‘candy’ vocalic pattern, affix, 


prosodic structure 


Within this approach, there is no morphological element consisting solely of three 
consonants, and the emphasis here is on a “morphological element”. Of course, related 
words share consonants, but these are stem consonants, where the stem is a morphologi- 
cal unit (e.g. tapél in me-tapel ‘caretaker’), but the consonants are phonological elements. 


2.3 Word-based item-and-arrangement 


Item-and-arrangement can also be applied within the word-based approach, but only if 
a root is extracted from the base word (Ornan 1983; Bolozky 1978). That is, the base is 
the word but the root is an intermediate morphological element in the derivation. The 
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derivation proceeds in two stages — extraction and association (Bat-El 1986; 1989). For 
example, the word sabón ‘soap’ serves as the base for the verb sibén ‘to soap’, which is 
derived in two stages: (i) extraction of the consonants s,b,n, which automatically become 
the root J/sbn (traditionally called a secondary root), and (ii) association of this newly 
formed root with the verb configuration CiCeC. The assumption is that the extracted 
consonants carry the semantic properties of their base, which are, in turn, carried over 
to the derived form. 

However, root extraction is necessary only when one is limited to the root-based ap- 
proach, and thus to item-and-arrangement. In this model, all words are derived via as- 
sociation of a root with a configuration, regardless of whether the base is a word or a 
root. Not only is there no independent reason to prefer root extraction to stem modifica- 
tion (§2.2), but also there is empirical evidence refuting root extraction. These are cases 
of phonological transfer (§3.1), whereby properties that cannot be carried over by the 
consonants are transferred from the base to the derived form. 


3 Phonological and morphological relations 


3.1 Transfer of phonological structure 


The most striking evidence for a direct relation between words, without an intermediate 
stage that derives a root, is provided by cases exhibiting phonological transfer (Clements 
1985; Hammond 1988; McCarthy & Prince 1990). As shown below, there are cases where 
structural information, which cannot be encoded in the consonantal root, is transferred 
from the base to the derived form. In the case of Hebrew, the structural information is 
both prosodic and segmental (Bat-El 1994). 


3.1.1 Prosodic transfer 


Prosodic transfer includes transfer of the entire configuration or of a consonant cluster. 

Configurations are often assigned a grammatical function (Doron 2003), but the ques- 
tion is whether this grammatical function is a property of the configuration or just a 
property shared by many (but not all) words within a morphological class. In general, 
words that share meaning are often structurally similar, but it does not necessarily mean 
that this shared meaning is a property of a morphological unit. One striking example is 
displayed by the nouns in Table 3 below, most of which are creative innovations (drawn 
from http://www.dorbanot.com). These nouns share the configuration CoCCa and the 
meaning ‘related to a computer program’. 

Since these nouns share a configuration and meaning, the traditional Semitic morphol- 
ogy would assign the meaning to the configuration. This is, of course, erroneous because 
there are other nouns with the configuration CoCCa that do not carry this meaning; e.g. 
jofga ‘dignity’ (cf. jafag ‘honest’), yoyma ‘wisdom’ (cf. yayam ‘smart’), ofsma ‘strength’ 
(cf. afsum ‘huge’), jozma ‘enterprise’ (cf. jazam ‘to initiate’). In addition, this meaning is 
too specific to function as a morpho-semantic feature. 
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Table 3: Nouns sharing a configuration. 


CoCCa noun Related word 

toyna ‘computer program’ toynit ‘program’ 
gonva ‘stolen computer program’ ganav ‘to steal’ 
porna ‘computer program with porno pop-ups’ pogna ‘pornography’ 
fsogva ‘illegally burned computer program’ fSarav ‘to burn’ 
gomla ‘old computer program’ gimlaot ‘pension’ 


What we actually have here is a Semitic-type blending. The last four words in the first 
column of Table 3 use the first word toyna as a base form, from which the configuration 
is drawn, along with the basic meaning. That is, toyna provides the configuration CoCCa 
and the meaning ‘relating to a computer program’. The stem consonants are drawn 
from the related words in the third column of Table 3, along with some specific meaning 
denoted by this word. Crucially, such a derivation must be word-based, and the fact that 
these words are creative innovations suggests that this model is active in the Hebrew 
speakers’ grammar. 

Other creative examples are found in a children’s story written by Meir Shalev (soni 
venomi vehadov jaakov ‘Roni and Nomi and the bear Jacob’). Each invented word in the 
first column of Table 4 has two bases, one providing the configuration and another the 
consonants. 


Table 4: Meir Shalev’s invented words. 


Invented word Source of configuration Source of consonants 

koféfet ‘she wears lovéfet ‘she wears’ kfafot ‘gloves’ 
gloves’ 

mogéfet ‘she puts on noélet ‘she puts on magaf ‘boot’ 
boots’ shoes’ 

lehitmahes ‘to hurry/ lehizdarez ‘to hurry’ lemahes ‘to rush’ 
rush’ 

layuts ‘to run out’ lagufs ‘to run’ hayufsa ‘outside’ 


Given that the invented words draw semantic properties from the two base words, as 
is usually the case with blends, direct access to the base must be assumed. That is, the 
configuration of one of the base words is imposed on the other. 

Cluster transfer is often found in denominative verbs like transfer — tginsfeg ‘to trans- 
fer' and faks — fikses 'to fax' (Bolozky 1978; McCarthy 1984; Bat-El 1994). In such cases, 
the distribution of the sequential order of vowels and consonants, thus including the 
clusters, is preserved in the derived form. For example, filter ‘filter’ is the base of the 
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verb filter (preserved cluster — It), while flist ‘flirt’ is the base of flistet (preserved clus- 
ters — fl, st), and not *fileet. Note that the higher the structural similarity between the 
base and the derived form, the closer the semantic relation (Raffelsiefen 1993), and thus, 
the fewer the structural amendments required in the course of stem modification (82.2), 
the greater the semantic similarity. 


3.1.2 Segmental transfer 


Segmental transfer includes vowel transfer as well as the transfer of an affix consonant 
to the stem (Bat-El 1994). 

In vowel transfer, an exceptional configuration is selected because its vowel is identical 
to that of the base (e.g. kod ‘code’ — koded ‘to codify’, ot ‘sign’ — otet ‘to signal’). It 
should be noted that in most cases, the regular configuration is also possible (e.g. kided 
‘to codify’). However, the exceptional configuration is used only when the base has an 
o. That is, there is output-output correspondence between the base noun kod and the 
derived form, and koded is segmentally more faithful to kod then kided (Bat-El 2003). 

In affix transfer, the consonant that serves as an affix in the base becomes a stem 
consonant in the derived form. This is common with the suffix -n, as in pasfan 'com- 
mentator’ — pisfen ‘to commentate’ (cf. peref ‘to interpret’) and the prefix m-, as in 
mayzok ‘cycle’ — miyzek ‘to recycle’ (cf. yazag ‘to return’). Note that speakers’ mor- 
phological knowledge allows them to strip the word of its affixes (more so in regular 
forms), and therefore the inclusion of an affix consonant in the derived words has its 
purpose, mostly to preserve a semantic contrast, as in yizeg ‘to court’ vs. miyzek ‘to re- 
cycle’ (from mayzosg ‘cycle’). But in the paradigm of famar ‘to guard’ - mifmap ‘guard’ 
there is no *mifmex (though it is a potential verb). 


3.2 Semantic distance 


One crucial property distinguishing among the three approaches reviewed in §3 is the 
semantic “distance” between related words; among these, only the WoBIP model (1c) 
allows a direct relation between a base and its derived form. 


(1) The distance factor 


a. Root-based item-and-process 


J/sdx 


D EAM 


sidéx sédex 
to arrange order 
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b. Word-based item-and-process 


sédex J/sdx 
‘order’ 
sidéx 
‘to arrange’ 


c. Word-based item-and-arrangement 


sédex sidéx 
order’ ‘to arrange 


The advantage of the direct relation (1c) is that information can be carried over from 
input to output, be it structural (§3.1) or semantic. It is often the case that within a 
group of words sharing stem consonants, there is 1st, 2nd or higher degree of separation 
between words, as illustrated in Figure 2. 

Such a network can express different degrees of semantic relations, depending on how 
far one word is from another. Needless to say, such a network cannot be expressed if 
all words are derived from a single root. Of course, one can claim that the three words 
at the middle of the network (takdím, kidómet, and mikdama), which are not directly 
related to one another, are derived from a root, while all other words are derived from 
words (McCarthy 1979; Arad 2005). However, this is an unsupported and unnecessary 
burden on the system. All words in the network are connected to one another, directly 
or indirectly, where some words are basic and others are derived. The fact that all the 
words in Figure 2 share the stem consonants is due to the important role of consonants 
in conveying lexical information and lexical relations (see §5.2). 


3.3 Derivation without a configuration 


A fundamental element of the traditional root-based item-and-arrangement model is that 
every word consists of a root and a configuration, where every configuration is a func- 
tion. This is particularly essential in the verbal paradigms, where the configurations are 
claimed to carry grammatical categories, such as transitivity (Doron 2003; Arad 2005). 
Such a theory predicts that the transitivity relation must involve a change in the config- 
uration. This is true for most cases (e.g. katáv ‘to write’ — hitkatév ‘to correspond’, falay 
‘to send’ - niflay ‘to be sent’, layafs ‘to press’ — hilyifs ‘to cause to feel pressured’), but 
not all. 

In an extensive study of labile alternations in Hebrew, Lev (2016: 114-115) lists 91 verbs 
where transitivity does not involve a change of the configuration; three of his examples 
are provided in Table 5. As Lev argues, a root-based morphology cannot accommodate 
labile verbs because under this approach the root has to associate with two different con- 
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kédem 
ancient time 
kadim 
ancient 


takdim 
‘precedent’ 
kódem kidómet 
‘before’ ‘prefix’ 
SE mikdama 
x à advanced 
early Y 
payment 


hikdím mm 
‘to be early’ mm 
kidmí kidmá kidém 
‘in front’ ‘progress’ ‘to promote’ 
Eg 


‘to Eg 


kadíma 
'ahead' 
Figure 2: Degrees of separation. 


figurations in order to derive verbs contrasting in transitivity. The word-based approach, 
on the other hand, can incorporate labile verbs, assuming that transitivity in such verbs 
is not lexically specified but rather derived from the syntactic context. That is, some 
verbs are specified for [+ transitive] and others, i.e. the labile verbs, are [o transitive]. 
Many of the examples in Lev's list are recent innovations, i.e. where verbs with transi- 
tivity specification become labile. For example, the verb tijel used to have one meaning 
only, ‘to walk’, but today it also means (at least for some speakers) ‘to walk someone 
(usually a dog)’. This change can be viewed as a loss of transitivity specification, i.e. [- 
transitive] >[o transitive]. Crucially, it is the verb that loses its specification for transi- 
tivity, not the configuration. That is, in historical change too, as shown in the ensuing 
84, it is the word that changes, and not some putative consonantal root. 
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Table 5: Labile verbs (Lev 2016: 114-115). 


Verb Transitive Intransitive 


hifyis ‘to make black’ ‘to become black’ 
hivrik ‘to polish’ ‘to shine’ 
hivri ‘to cure ‘to recuperate’ 


4 Historical change 


4.1 Configuration change 


Over the course of time, words may change their meaning or their structure. In his study 
of instrumental nouns in Hebrew, Laks (2015) shows that quite a few instrumental nouns 
undergo change in their configuration, in particular within a compound, as illustrated 
in Table 6. As Laks shows, the change always goes towards the participial configuration, 
and it never occurs when the instrumental noun does not have a verbal counterpart. That 
is, while both mayded ‘pencil sharpener’ and mazleg ‘fork’ have the same configuration, 
only the former adopts the participial configuration meyaded, given its verbal counter- 
part yided ‘to sharpen’; the latter cannot adopt a participial configuration because it does 
not have a verbal counterpart. 


Table 6: Change of configuration in instrumental nouns (Laks 2015). 


Old configuration New configuration - participle 

mayded | maCCeC  meyaded meCaCeC ‘pencil sharpener’ 
nakdan CaCCan menaked (tekstim) meCaCeC ‘text vocalizer’ 
masyeta maCCeCa  soyet (mifsim) CoCeC ‘juicer (juice squeezer)’ 


In order for this restriction to hold, an instrumental noun must be linked to its verb, 
from which it can draw its participial configuration. Otherwise, as Laks argues, the in- 
strumental noun could adopt any of the five participial configurations, not necessarily 
the one associated with its verb. That is, we get the instrumental noun meyaded ‘pen- 
cil sharpener’ because meCaCeC is the participial configuration of yided ‘to sharpen’. 
Similarly, we get the instrumental nouns soyet (mifsim) ‘juicer’ because CoCeC is the 
participial configuration of sayat ‘to squeeze’. Such a change is possible only in a word- 
based lexicon; a root-based lexicon does not account for the restrictive generalization as 
it allows options that are not attested. 
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4.2 Semantic change 


Over the course of time, the meaning of words also changes; crucially, the semantic 
change affects words and not putative roots. For example, the verbs nimlat and himlit 
used to differ in transitivity only, with the former meaning ‘to escape’ and the latter 
‘to help someone to escape’. Nowadays, these verbs are not related, since himlit means 
‘to give birth (for non-humans)’. Similarly, kalat and hiklit used to be related, with the 
former meaning ‘to absorb’ and the latter ‘to cause to absorb’. However, the meaning of 
hiklit is now ‘to record’, and the two verbs are vaguely related, if at all. For the traditional 
root-based approach (§2.1), it would be rather strange that the change in meaning does 
not affect the element that carries it, i.e. the root. This inconsistency does not arise within 
the word-based approach. 

It is quite feasible that the root does not undergo semantic change because its mean- 
ing is just “a basic meaning range”, according to Moscati (1980) and other Semiticists, 
or underspecified, according to Arad’s (2005) analysis within the theory of Distributed 
Morphology (Halle & Marantz 1993 and subsequent studies). That is, semantic specifica- 
tion of roots may have at least three degrees of specification: fully specified (e.g. boy), 
underspecified (e.g. Hebrew roots), and unspecified (e.g. the roots in refer, remit, and 
resume; Aronoff 1976). 

The major problem is that the specific meaning of words is derived, according to Arad 
(2005), from the morpho-syntactic context, ie. the configurations. This works nicely 
for some words but not for others. Consider, for example, the pairs zasak ‘to throw’ - 
hizwik ‘to inject’ and mafay ‘to pull’ - himfiy "to continue’. It is not clear which semantic 
property can be assigned to the configurations CaCaC and hiCCiC such that the relation 
within these pairs would be consistent. 


4.3 Segmental change 


Like semantic change, segmental change also affects words and not consonantal roots, 
even when the change is in the stem consonants. This is seen in the case of stop-fricative 
alternation, which due to its opacity, suffers from a great degree of change-oriented 
variation (Adam 2002). 

As shown in Table 7, normative verb inflectional paradigms are changing under the 
force of paradigm uniformity. Although the change affects consonants, it certainly does 
not affect a consonantal root because derivationally related words are hardly ever af- 
fected; nonetheless they change, and sometimes they even undergo independent change. 
For example, while y can change to k in katav - jiktov (normative jiytov) ‘to write PAST 
— FUTURE’, it never changes to k in miytav (*“miktav) ‘letter’. Note also that while the 
direction of change in this paradigm is from a fricative to a stop (Jiytov — jiktov), the 
change in a related pair is towards a fricative, as in ktav — ytav ‘handwriting’. 
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Table 7: Change of configuration in instrumental nouns (Adam 2002). 


Old paradigm New paradigm 


k-y kisa  jexase y-x xisa  jeyase ‘to cover PAST — FUTURE’ 
k-x  katav  jixXtov k-k  katav jiktov ‘to write PAST — FUTURE’ 
b-v  bitel jevatel v-v vitel  jevatel ‘to cancel PAST - FUTURE’ 


5 Other supporting sources 


5.1 Children's words 


During the early stages of acquisition, verbs in the production lexicon of children ac- 
quiring Hebrew are not derivationally related, i.e. they do not share stem consonants. 
Derivationally related verbs start appearing later on, where the new verbs "are learnt as 
versions of, and based upon, the verbs known from before" (Berman 1988: 62). 

This direct derivation in children's speech is not surprising given Ravid et al.'s (2016) 
study on the distribution of verbs in spoken and written Hebrew corpora: child-directed 
speech (to toddlers age 1;8-2;2) and storybooks (for preschoolers and 1st-2nd grade). In 
both corpora, the average number of verbs per root was below two: 684 verbs for 521 
root types in the spoken corpus (1.3) and 1,048 verbs for 744 roots in the written corpus 
(1.4). Only around 30* of the verb types in each corpus share a root with another verb, 
and most such verbs share a root with only one other verb. 

These results mean, as the authors admit, that at least until the age of 7, the children 
have very little input supporting a root-based morphology. Nevertheless, the authors 
insist that the children must “eventually construe the root as a structural and seman- 
tic morphological core" (Ravid et al. 2016: 126). As argued in the current paper and 
elsewhere, starting with Bat-El (1994), Hebrew speakers are free from this burden since 
Hebrew morphology (and Semitic morphology in general) is not root-based, but rather 
word/stem-based. 

Previous studies that attribute children and adults' innovations to root extraction (82.3 
— word-based item-and-arrangement) must now reconsider their conclusion at least for 
children below the age of 7. In an experimental study reported in Berman (2003), children 
at the age of 4-6 years old had a rather high success rate (84-88%) of morphological 
innovation (forming novel verbs from nouns or adjectives) with a very high percentage 
of well-formed innovations (91-99%). If children can form verbs from nouns/adjectives 
at the stage where they still do not have sufficient input that allows them to form a 
root-based morphology (Ravid et al. 2016), they probably use another strategy - the 
modification strategy employed within the WoBIP model (82.3). And if they can use 
this model successfully until the age of 7, they have no reason whatsoever to shift to a 
root-based model later on. Of course, as I have argued here and elsewhere, they do not 
- Hebrew speakers employ WoBIP at all ages. 
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5.2 Experimental studies 


There are quite a few experimental studies supporting the consonantal root in Hebrew. 
Most notable are Berent’s studies with the acceptability rating paradigm (Berent & Shim- 
ron 1997; Berent, Everett & Shimron 2001, inter alia) and Frost’s studies with the priming 
paradigm (Frost, Forster & Deutsch 1997; Frost et al. 2000, inter alia)? However, most 
experimental studies supporting the consonantal root in Hebrew morphology adopted 
a visual modality. As such, they cannot tease apart the effect of orthography, which is 
primarily consonantal (Bat-El 2002 and Berrebi 2017 for a critical view). 

A fresh look on the matter is provided in Berrebi’s (2017) auditory priming study, 
which controlled semantic relatedness and orthographic identity. Word pairs sharing 
phonological stem consonants were either semantically related (e.g. kibel ‘to receive’ - 
hitkabel ‘to be accepted’) or semantically unrelated (e.g. gigel ‘to spy’ — hitragel ‘to get 
used to’); and when semantically unrelated, either orthographically identical with re- 
spect to the consonants (e.g. gigel ‘to spy’ - hitsagel ‘to get used to’) or orthographically 
different (e.g. Jikeg ‘to lie’ — hiftakew ‘to get drunk’, where k is spelled differently). The 
results showed that all conditions had a priming effect, i.e. whether or not the prime 
and the target were orthographically identical or semantically related. As the property 
shared by the prime and the target in all conditions was phonological, i.e. stem con- 
sonants, the results suggest that there is a phonological priming effect among words 
sharing stem consonants. Crucially, the stem consonants are not a morphological unit 
since there was also a priming effect when the words were semantically unrelated and 
orthographically different (e.g. (ker — hiftakex). 

If we assume that priming effects reflect the organization of the lexicon, then we can 
conclude that words are also phonologically organized according to the stem consonant. 
As emphasized in $2.2, the stem consonants are phonological elements (consonants) 
within a morphological unit (stem); they do not carry meaning and they do not con- 
stitute a morphological unit. 

Stem consonants, and not vowels, serve to identify relations between words because 
consonants are lexically prominent, while vowels have syntactic functions (Nespor, Pefia 
& Mehler 2003; Berent 2013); this is true not only for Hebrew but also for non-Semitic 
languages. In their experimental study, Cutler et al. (2000) asked the participants: “Is a 
kebra more like cobra or zebra?". They found that speakers identify similarity between 
a nonce word (kebra) and an existing word on the basis of shared consonants (kebra - 
cobra) rather than shared vowels (kebra — zebra). That is, the consonants serve as the 
core of similarity between words in English, French, Swedish, and Dutch as much as 
they do in Hebrew and other Semitic languages (see also Ooijen 1996; New, Araujo & 
Nazzi 2008; Carreiras & Molinaro 2009; Winskel & Perea 2013). 

Consonants are lexically prominent from the very early stages of language develop- 
ment. This is reported in Nazzi & New's (Nazzi & New) study, where French 16-20 month 
old infants could learn in a single trial two new nonce words if they differed by one con- 


5 In an additional study, which was design to ask “is it a root or a stem?" (rather than “is it a root?”), Berent, 
Vaknin & Marcus (2007) note that although their results do not falsify the root-based account, they strongly 
suggest that the stem can account for the restrictions on identical consonants. 
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sonant (pize — tize), but not if they differed by one vowel (pize — paze). That is, although 
vowels are acoustically more prominent than consonants, when it comes to lexical con- 
trast, consonants are employed. This is true for children and adults, regardless of the 
ambient language, whether it is Semitic or non-Semitic. 

Consonants are prominent not only in speech perception and lexical relations but also 
in the association between sound and shape revealed by the bouba-kiki effect (Köhler 
1929), whereby people pair labial consonants with round shapes and dorsal consonants 
with spiky shapes. One of the many subsequent studies of the bouba-kiki effect is Fort, 
Martin & Peperkamp (2015), which found that the sound-shape association remains con- 
stant regardless of the vowels. That is, lomo was associated with a round shape as much 
as limi, and toko with a spiky shape as much as tiki. Fort, Martin & Peperkamp (2015) 
conclude that consonants have a greater effect than vowels in sound - shape association. 


6 19th century Semitic grammarians 


The root-based item-and-arrangement model of Semitic morphology has been deeply 
entrenched for generations, thus presenting the advocates of the word-based item-and- 
process approach as revolutionary (see Horvath 1981; Lederman 1982; Heath 1987; Ham- 
mond 1988; McCarthy & Prince 1990; Bat-El 1994; 2002; 2003; Ratcliffe 1997; Ussishkin 
1999; 2000; 2005; Laks 2011; 2015; Lev 2016). 

However, WoBIP has its seeds in the studies of the orientalists Wilhelm Gesenius 
(1786-1842) and William Wright (1830-1889), who wrote the seminal grammar books of 
Hebrew and Arabic respectively. It is important to note that both Gesenius and Wright 
were not native speakers of a Semitic language (Gesenius was German and Wright 
British), and thus not biased like the other Semitic grammarians by the consonantal 
script of Hebrew and Arabic. 

Gesenius (1813) distinguishes between "primitive" verbs, which consist of a stem only 
and are not derived from any other form, and derived verbs, among which there are ver- 
bal derivatives and denominative verbs. Gesenius used the term "internal modification" 
when addressing the processes involved in the derivation. He indicates two types of 
"changes in the primitive form" (Gesenius 1813: 115): internal modification (cf. stem mod- 
ification; $2.2) and repetition (i.e. reduplication) of one or two of the stem consonants. 
Within the internal modification, he includes vowel change like gadal ‘to grow’ — gidel 
‘to raise’, and gemination as in Biblical Hebrew ga:dal ‘to grow’ — giddel ‘to raise’ (there 
is no gemination in Modern Hebrew). Crucially, Gesenius compares vowel modification 
in Hebrew to that in English lie — lay and fall — fell, and does not find them different. 
That is, Gesenius finds stem modification to be identical in both Hebrew and English, 
but unlike Anderson (1992) who contemplates whether English is like Hebrew, Gesenius 
actually claims that Hebrew is like English. 

A similar approach is found in Wright's (1859) grammar book of Arabic, where he 
describes the relation between verbs within the WoBIP model. For example, "the third 
form ... is formed from the first by lengthening the vowel sound after the first radical" (p. 
32) or “[T]he second form ... is formed from the first ... by doubling the second radical" 
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(p. 31). This is the format of Wright’s description of each and every binyan in Arabic, and 
it specifically says that (i) one form is derived from another, i.e. word-based derivation, 
and (ii) the derivation involves some process, doubling, lengthening, etc., i.e. item-and- 
process. Note that Wright uses the term “radical” to refer to a consonant in the stem, 
without reference to the stem consonants being an independent morphological unit. 

That is, although it has always been said that the root-based approach is the one as- 
sumed by traditional Semiticists, it is important to emphasize that the two great 19th cen- 
tury Semiticists, Gesenius and Wright, were proponents of the WoBIP model of Semitic 
morphology.° 


7 Concluding remarks 


In §3.6, Anderson (1992: 71) concludes: “... the morphology of a language consists of a 
set of Word Formation Rules which operates on lexical stems to produce other lexical 
stems ...” In this paper I extended the scope of this model to Semitic morphology. That 
is, in Semitic languages too, words are derived from words/stems via modification of the 
base. 

The modification in Semitic morphology is output oriented, as the output has to fit 
into a configuration. The constraint-based framework of Optimality Theory (Prince & 
Smolensky 1993) allows for output-oriented grammar, where the constraints impose cer- 
tain configurations (structural constraints) as well as output-output identity of conso- 
nants (faithfulness constraints). A configuration is imposed by several constraints, refer- 
ring to syllabic structure (usually a foot), syllable structure, and vocalic patterns (where 
the latter ones are language specific). Identity among the stem consonants is imposed by 
the faithfulness constraints, where preservation of segmental identity ensures preserva- 
tion of morphological relation among words. 

That is, the stability of the stem consonants is due to phonological faithfulness con- 
straints that require identity among stem consonants. Phonological faithfulness en- 
hances morphological relations. “Any given focal word (that is, a specific word in which 
we are interested) is thus surrounded by a vaguely defined family of words which are 
more or less acoustically similar to it. The members of the family will in general have the 
widest variety of meaning, and yet it may often happen that some members of the family 
will resemble the focal word not only in acoustic shape, but also in meaning” (Hockett 
1958: 297, 1987: 86). That is, within an acoustic family of words there is a morphological 


6 A reviewer suggested that Gesenius and Wright adopted the word-based approach, which was used for 
Latin grammar, because they worked prior to the introduction of the term morpheme. Kilbury (1976) and 
Anderson (1985) attribute the term morpheme to Baudouin de Courtenay's student H. Ułaszyn, in his ar- 
ticles from 1927 and 1931. However, it is possible to have a notion without a specific term. Sibawayhi 
(760—796), who wrote the first known Arabic grammar Al-kitab, used the term kalima ‘word’ in the sense 
of a morpheme (e.g. the suffix -ta) and referred to the radicals that make up the words (Levin 1986). Gese- 
nius notes that the Jewish grammarians call the stem root and the stem consonants radical letters. That is, 
there was a reference to morphological units (stem, affixes), but the stem consonants did not constitute a 
morphological unit. 
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family, where the words are not only acoustically similar but also semantically related. 
For the purpose of membership in a morphological family, the consonants are more im- 
portant than the vowels. This status does not grant the consonants morphological status, 
neither in English nor in Hebrew. 
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Root-based syntax and Japanese 
derivational morphology 


Brent de Chene 


Waseda University 


This paper argues that the formation of transitive and intransitive verb stems in Japanese, a 
process that has been widely seen as supporting the Distributed Morphology view of deriva- 
tional stem-formation as performed by the syntax, cannot in fact be analyzed as syntactic. 
The Japanese data are thus consistent with Anderson’s (1982) claim that it is precisely that 
morphology traditionally classified as inflectional that reflects syntactic operations. 


1 Introduction 


In a well-known paper, Anderson (1982: 587) proposes that “Inflectional morphology 
is what is relevant to the syntax,’ where syntactically relevant properties are those “as- 
signed to words by principles which make essential reference to larger syntactic struc- 
tures.” He claims further that a delimitation of inflection on this basis closely mirrors 
the traditional understanding of where the boundary between inflection and derivation 
lies. In contrast, the Distributed Morphology literature, in treating syntax as root-based 
and stem formation of all types as syntactic, denies significance to the traditional dis- 
tinction between inflection and derivation and renders vacuous the claim that inflection 
is just that portion of morphology that realizes elements and properties manipulated 
by the syntax.! The present paper takes up the formation of transitive and intransitive 
verb stems in Japanese, a case that has been widely seen as supporting the DM view of 
stem-formation as performed by the syntax, and argues that a closer look reveals that 
the derivational processes in question cannot in fact be analyzed as syntactic. In the 


! The founding paper of the DM framework, Halle & Marantz (1993), presents DM as a theory of inflection 
and makes no explicit claims about derivation, but the adoption of root-based syntax and the rejection 
of the inflection/derivation distinction are clear at least by Marantz (1997; 2001). See below for further 
references. 
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end, then, the Japanese data is consistent with Anderson’s view that there is a funda- 
mental distinction between inflection and derivation and that the criterion of syntactic 
relevance picks out just that morphology traditionally classified as inflectional.” 

In recent years, the derivational morphology of the Japanese verb has become a stan- 
dard example (as in Harley 2012) illustrating the DM claim that syntax is root-based - the 
claim, that is, that along with functional morphemes, the atoms of syntactic computation 
are roots rather than (inflectable) stems or (inflected) words (Embick & Marantz 2008: 5). 
In particular, it has become widely accepted (Marantz 2013: 106) that the Japanese suf- 
fixes that create transitive and intransitive verb stems are instances of little v, causative 
and inchoative, that attach to roots and thus that the verb stems themselves are syntactic 
constructions — much like, say, the combination of a verb stem with a tense element or 
a main verb with an auxiliary. Here, I note first that these claims about the constituency 
of Japanese verb stems rest on a restricted database that masks the fact that a signifi- 
cant number of stems involve sequences of two transitivity-determining suffixes. I then 
present the failure of two nested suffixes to interact in the way expected of syntactic 
elements — in particular, the fact that an inner suffix must be taken as invisible for pur- 
poses of semantic interpretation and argument structure - as the first of several related 
arguments casting doubt on the proposal to generate Japanese verb stems syntactically. 

The data on which DM theorists base their claim that the verbal derivational suffixes 
of Japanese are instances of little v attaching to roots is the appendix of Jacobsen (1992), 
which represents a light revision of the appendix of Jacobsen (1982), and in turn appears 
lightly revised as Appendix I in Volpe (2005). That appendix consists of roughly 350 pairs 
of isoradical intransitive and transitive verbs presented in their citation forms (Imper- 
fect/Nonpast Conclusive) and sorted into sixteen classes depending on the derivational 
suffixes that appear at the right edge of their stems. The fact that the Jacobsen/Volpe 
appendix is limited to verb stems presented pairwise means that using it as a basis for 
the identification of root requires assuming for each transitivity pair that there are nei- 
ther stems of other lexical categories nor verb stems outside the transitivity pair that 
provide information about the relevant root. $2 below, in the context of presenting back- 
ground information on Japanese derivation, introduces a number of cases in which this 
assumption is unjustified. The following three sections, building on the observations of 
82, present reasons for doubting that verb stems are syntactically derived. While for 
concreteness I refer throughout to the DM literature cited above and related work, the 
argumentation is intended to apply to any proposal to generate Japanese verb stems 
syntactically. 

83, first, shows that a substantial minority of verb stems involve two transitivizing (T) 
or intransitivizing (I) suffixes (with the four orders TT, TI, IT, II all attested), but that 
an outer suffix must be taken to render an inner one null and void for purposes of argu- 
ment structure and semantic interpretation. $4 shows that the same is true for the suffix 


? On a personal note, while I have taken the idea that inflection is precisely the syntactically relevant morph- 
ology as a guiding principle for many years, it was anything but obvious to me at the time Steve proposed 
it. It ranks high in my personal inventory of the many things I have learned from Steve, and I am happy 
to have this opportunity to reaffirm it in a volume dedicated to him. 
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pair -m- (verbal) and -si- (adjectival), with the additional complication that the order in 
which those two suffixes appear relative to a root R is an idiosyncratic function of R. §5, 
finally, argues against a syntactic account of stem formation on the basis of semantic 
change, claiming, for lexical causatives in particular, that the diachronic instability of 
the putatively compositional causative interpretation (much as if a phrase like kick the 
bucket were to lose its compositional interpretation, retaining only the idiosyncratic one) 
shows that that interpretation cannot have been based on a syntactic derivation in the 
first place. In all of these cases, the behavior of the derivational suffixes under consid- 
eration is contrasted with that of inflectional and uncontroversially syntactic elements. 
§6, a brief conclusion, sketches two possible non-syntactic approaches to derivational 
morphology and speaker knowledge thereof and suggests that the choice between them 
for cases like the one considered here remains a topic for further research. 


2 Background 


In considering the shortcomings of Jacobsen’s (1982; 1992) appendix as a database for 
Japanese verbal derivation, the first thing to note is that the pairwise presentation of the 
data does not always adequately represent the relations of isoradicality that hold among 
verb stems. This is because a number of roots underlie three or (in at least one case) four 
verb stems rather than two; in such cases, Jacobsen either lists two pairs in separate 
places or, as we will see below, omits one of the stems. In several cases involving three 
stems on a single root, there are two pairs of stems differentiated at least roughly by root 
alloseme, with a formal contrast for either transitives or intransitives but not both. For 
example, the difference between the allosemes 'solve' and 'dissolve, melt' of the root tok- 
corresponds to a formal distinction for transitives but not for intransitives, as shown in 


(1) and (2)? 


(1) a. tok-e- ‘be solved’ 
b. tok- ‘solve’ 
(2) a. tok-e- ‘melt (i) 


b. tok-as- ‘melt (t) 


In other cases, as in (3) and (4), there is no alloseme-dependent pairing, simply a triplet 
of isoradical stems. 
(3) a. tunag-ar- 'be connected' 
b. tunag-e- ‘connect (t)’ 


c. tunag- ‘connect (t) 


3 Below, taking the distinction between inflection and derivation in Japanese to be uncontroversial, I use 
stem in the traditional meaning *morpheme (sequence) subject to inflection" and cite bare stems rather 
than inflected forms; “(i)” and "(t)" in glosses indicate intransitive and transitive meanings, respectively. 
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(4) a. uk- ‘float (i)’ 
b. uk-ab- ‘float (i)’ 
c. uk-ab-e- ‘float (t)’ 


In these last two cases, the policy of pairwise listing results in one stem of each isoradical 
set (specifically, 3b and 4a) being left out of the database. 

In fairness to Jacobsen, it must be noted that morphological analysis was not his aim 
in compiling his appendix. Most crucially for our purposes, he nowhere refers to the 
notion “root”, and it is only with Volpe’s (2005) DM treatment that the root becomes a 
central concept in the interpretation of the appendix data. Volpe’s (2005: 121 (note 27)) 
procedure for root extraction, however, amounts to simply peeling off the outermost 
derivational suffix and labeling the residue a root, and he has been followed implicitly 
in this practice by other DM theorists. 

We should observe before proceeding that there are many cases, illustrated by (5) 
below, in which Volpe’s procedure does in fact yield a root. 


(5) a. nao-r- ‘get better (illness, injury); get repaired’ 


b. nao-s- ‘cure; repair’ 


(5) is clearly the kind of case Marantz (2013: 106) has in mind when he says about Japanese 
that “there seems overwhelming support for analyzing the suffixes signaling either the 
lexical causative as opposed to the inchoative or the inchoative as opposed to the lexical 
causative as realizations of a little v head attaching to the root.” As we will now see, 
however, there are a number of respects in which the properties of (5) do not generalize 
to the Japanese derivational system as a whole. Most crucially, there is reliable evidence 
for a number of Volpe’s “roots” that they are actually morphologically complex, with 
the result that many verb stems contain two derivational suffixes rather than one. Given 
that, as we have already noted, Volpe’s procedure for root extraction involves no attempt 
to compare verb stems with stems of other lexical classes or with verb stems outside the 
transitivity pair under consideration, this result is unsurprising. Let us examine a few 
representative cases. 

Consider the sequence tunag- of (3) above. Comparison of that sequence, roughly 
meaning ‘connect’, with the noun tuna ‘rope’ suggests that the former is underseg- 
mented, and in particular that the transitive stem tunag- consists of the noun stem tuna 
(or the root that underlies it) suffixed with -g-. This suggestion is confirmed when we 
observe that -g- is suffixal in a number of other stems as well, with a core subset ((6-7 
below and the three of note 3) displaying a very specific semantics: -g- takes as input a 
noun stem denoting a tool T and returns a verb stem with the meaning “to make typical 
use of T”. Three examples that occasion resegmentation of entries of the Jacobsen/Volpe 
appendix are given in (6) through (8), with both a transitive and an intransitive stem 
noted in each case.* 


4 Three further examples whose status in the contemporary language might be thought questionable are 
tumu-g- ‘spin (thread)' (tumu ‘spindle’), ha-g- ‘fletch (arrow)' (ha ‘feather’), and, with an irregular alterna- 
tion of t with s, husa-g- ‘cover, stop up’ (huta cover"), 


138 


7 Root-based syntax and Japanese derivational morphology 


(6) a. tuna ‘rope’ 
b. tuna-g- ‘tie together, tie up’ 
c. tuna-g-ar- ‘get connected? 
(7) a. to(-isi) ‘whetstone’ 
b. to-g- ‘whet’ 
c. to-g-ar- ‘become pointed’ 
(8) a. mata ‘crotch, fork’ 
b. mata-g- ‘step over, straddle (t)’ 


c. mata-g-ar- ‘straddle (i)’ 


The derivational relationships postulated in (6-8) appear unimpeachable in both formal 
and semantic terms: the roots are nonalternating, and the semantic relationship between 
nominal and verbal meanings is unmistakable. 

More common as a stem-forming suffix than -g- is -m-, which can be shown to be a 
stem formant in several dozen verbs. (9-11) display three cases in which recognition of 
suffixal -m- forces resegmentation of strings that Volpe takes to be roots (the (a) items 
of (9) and (10) are adjective stems, and that of (11) is an adjectival noun, a stem with 
adjectival meaning but essentially nominal inflection). 


(9) a. ita- ‘painful’ 
b. ita-m- ‘be painful, get injured’ 
c. ita-m-e- ‘injure’ 

(10) a. yuru- ‘slack’ 
b. yuru-m- ‘slacken (i)’ 
c. yuru-m-e- ‘slacken (t) 

(11) a. hiso-ka ‘stealthy, secret’ 
b. hiso-m- ‘be hidden, lurk’ 


c. hiso-m-e- “conceal, mask’ 


We have seen that in addition to verb stems formed with the common suffixes -r- and 
-s-, illustrated in (5), there are verb stems formed with -g- and -m-. In fact, of the nine 
occurring stem-final consonants, all but n can be shown to be suffixal in some stems. 
Suffixal -b- has been illustrated in (4b) above; (12) through (14) display one example each 
for -k-, -t-, and -w- (w deletes in the phrasal phonology before nonlow vowels; here and 
below, I take reference to a suffix -C(V)- to subsume reference to its post-consonantal 
allomorph -aC(V)-). 


5 Kunio Nishiyama (personal communication) suggests the possibility that -g- in (6) is a (transitivity-neutral) 
verbalizer, with the transitivity of (6b) resulting from a null transitivizer parallel to the intransitive -ar- of 
(6c). A fully general form of this proposal will require the postulation of a very large number of morpho- 
logical zeros. 
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(12) 


na-k- ‘make characteristic sound’ (animal); ‘weep’ (human) 


Ge 


na-r- ‘sound (i)’ (inanimate subject) 
c. na-r-as- ‘sound (t) 
(13) a. hana-re- ‘move (i) away (from); be released’ 


b. hana-s- ‘move (t) away (from); release’ 


e 


hana-t- ‘release forcefully, discharge’ 


(14) a. muk- ‘face, look (in a direction)’ 


T 


muk-e- ‘cause to face, turn (t) (in a direction)’ 


B 


muk-aw- ‘face, proceed toward’ 


muk-aw-e- ‘(go to) meet, receive (a visitor)’ 


We see, then, that the inventory of suffixes that create verb stems of determinate tran- 
sitivity is a good deal larger than envisioned in the Jacobsen/Volpe appendix, where, 
apart from idiosyncratic formations, the relevant set is essentially limited to -r-, -s-, -re-, 
-se-, -e-, -i-, and zero. In closing this introductory section, let us consider two semantic 
issues that arise with respect to the Jacobsen/Volpe appendix data. The first involves the 
interpretation of roots, the second the interpretation of suffixes. 

Quite apart from the question of whether or not roots are taken to be elements that 
are manipulated by the syntax, no attempt to segment stems into roots and suffixes syn- 
chronically is a fully grounded project in the absence of a criterion for isoradicality — 
a criterion, that is, for determining when two given stems share a root and when they 
do not. The semantic lability of individual stems over time that will be illustrated in $5 
makes this by no means an idle question. It is, however, a question that neither Jacobsen 
nor Volpe engage with seriously; Jacobsen (1982: 38)° says only that the members of a 
transitivity pair must exhibit “a certain degree of semantic affinity", and Volpe (2005: 
32) confines himself to observing that “Root semantics is a wide-open area for further 
research". The question of isoradicality is essentially coextensive with the traditional 
problem of distinguishing homophony from polysemy, a problem that may ultimately 
be illuminated by psycholinguistic and neurolinguistic research (see Marantz 2013: 103). 
It is worth keeping in mind, however, that any program involving the synchronic iden- 
tification of roots requires innumerable provisional decisions on this matter. 

Turning now to the interpretation of the stem-forming suffixes of which we have seen 
a number of examples, let us note first that while Volpe (2005) follows Jacobsen (1982; 
1992) in referring to the two members of a transitivity pair as "intransitive" and "transi- 
tive", more recent literature such as Harley (2008; 2012) and Marantz (2013) use the more 
specific “inchoative” and “causative”. In fact, cases like ka-r- (Western Japan; cf. Eastern 
ka-ri-) ‘borrow’ versus ka-s- ‘lend’ and azuk-ar- ‘take on deposit’ versus azuk-e- ‘deposit’ 
show that even the former pair of terms is too specific to be accurate in general. This is be- 
cause the first member of each of those pairs shows "intransitive" morphology in spite of 
displaying what, under Burzio's generalization, are the twin hallmarks of causative little 


$ See also note 5, p.34 and the corresponding note 30 of Jacobsen 1992 (pp. 248-249). 


140 


7 Root-based syntax and Japanese derivational morphology 


v, namely an agentive external argument and accusative case-marking. Cross-linguistic 
parallels’ suggest that the treatment of ‘borrow’ as the intransitive counterpart of ‘lend’ 
is by no means accidental or exceptional. The phenomenon of a stem with causative 
meaning but “intransitive” morphology appears to show that if the semantics of the two 
morphological types are specified separately, they will have to overlap. Let us briefly 
note another type of example that suggests the same conclusion. 

The stems too-r- ‘pass through’ and mata-g- ‘step over, pass over, straddle’ (8b above) 
are closely parallel in both their semantics and their case-marking. When the subject 
is animate, as in (15) (where stem-internal segmentation is suppressed), that subject 
(marked nominative but omitted in the examples) is both an agent and a theme mov- 
ing along a path, and the accusative object is an intermediate point on that path. 


(15) a. Syootengai o toot-te eki ni modot-ta. 
shopping.district Acc pass.through-cj station DAT return-PF 


‘I passed through the shopping district and returned to the station’ 


b. Saku o  matai-de  hodoo ni hait-ta. 
barrier Acc step.over-cj sidewalk DAT enter-PF 


‘I stepped over the barrier and onto the sidewalk’ 


In other uses, the agent of examples (15) may be replaced by an inanimate theme, with 
matag- in the meaning ‘pass over’, or by a path argument, as in The road passes through 
the tunnel/over the train tracks. 

In spite of the close semantic parallelism between too-r- and mata-g-, however, the 
two stems differ in their transitivity status: too-r- is the intransitive corresponding to 
transitive too-s- ‘pass though (t), while mata-g- is the transitive corresponding to in- 
transitive mata-g-ar- ‘straddle’ (8c above), the latter differing from mata-g- in taking a 
dative rather than an accusative object. Unless too-r- and mata-g- are semantically dis- 
tinct in a way we have failed to identify, this fact shows that the transitivity status of 
a stem cannot be a function of that stem's semantics alone, and a fortiori cannot be a 
function of the semantics of that stem's suffix. An alternative possibility, which consid- 
erations of space preclude developing here, is that there is a continuum of degrees of 
transitivity, as suggested by Hopper & Thompson (1980) and subsequent work, and that 
what transitivity pairs have in common is that the "transitive" member has a higher de- 
gree of transitivity than the “intransitive” member? In any case, however, the evidence 
we have seen here is sufficient to establish that there is no simple, general account of 
the semantics of the suffixes that create transitivity-specific Japanese verb stems, and 
that, as was the case regarding the question of a criterion for isoradicality, much work 
remains to be done in this area. 

Above, we have seen that the data of the Jacobsen/Volpe appendix is a good deal more 
complex and irregular, both formally and semantically, than consideration of examples 


7 See Kuo (2015: 59, 84-85, 107) for the Taiwanese languages Amis, Puyama, and Seediq, respectively; other 
languages for which the relationship can be easily verified include Tagalog and Swahili. 

8 Jacobsen (1992: 73-74) develops a scalar concept of transitivity but does not suggest that the common point 
of transitivity pairs is a transitivity differential in favor of the morphologically transitive member. 
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like (5) might suggest. Nothing in the present section, however, is intended as an argu- 
ment for or against any particular treatment of that data. Taking our discussion of the 
Jacobsen/Volpe appendix as a starting point, we now turn, in Sections 3 through 5, to 
arguments against proposals to generate Japanese verb stems syntactically. 


3 Sequences of verbal suffixes 


As we have already noted, one consequence of the resegmentations that are entailed 
by comparing the stems that participate in transitivity pairs with stems of other lexical 
categories (as well as with other verb stems) is that many stems can be seen to display a 
sequence of two suffixes attached successively to a root rather than a single transitivity- 
determining suffix. For example, the (c) examples of (6) through (8) above all involve 
the sequence -g-ar-, where the first suffix creates a transitive stem and the second an 
intransitive. Similarly, the (c) examples of (9) through (11) all involve -m-e-, where the 
first suffix creates an intransitive stem and the second a transitive. Suffix sequences are 
also observed in (12c) and (14d). 

Sequences of two transitivizing suffixes and two intransitivizing suffixes are observed 
as well. For example, (16d) below, where (16) is an expansion of (6), involves the sequence 
-g-e-, where both suffixes create transitive stems, and (17c) involves the sequence -m-ar-, 
where both suffixes create intransitive stems. 


(16) a. tuna ‘rope’ 

b. tuna-g- ‘tie together, tie up’ 

c. tuna-g-ar- ‘get connected’ 

d. tuna-g-e- ‘tie together, connect’ 
(17 a. yasu-raka ‘peaceful, calm’ 


T 


yasu-m- ‘rest (i)’ 


yasu-m-ar- ‘become rested, at ease’ 


Ro 


. yasu-m-e- ‘rest (t)’ 


Recall now the DM claim that Japanese transitivity-determining suffixes are instances 
of little v, with at least an inchoative and a causative "flavor" (Marantz 2013: 107) to be 
distinguished. Abstracting away from the fact that (at a minimum) both types of little v 
will have to be polysemous, and writing the inchoative version as “v,” and the causative 
version as “v”, the structure of the two stems of (5), for example, will be as shown in 
(18) (simplified glosses given) . 


(18) a. nao-r- [[R]vi] ‘get better’ 
b. nao-s- [[R]v.] ‘make better’ 


In the same way, the structure of the stems (16c-16d) will be as in (19), and that of the 
stems (17c-17d) will be as in (20). (Here and below, I take the fact that -g- and -m- (and 
also -b-, -k-, -t-, -w-) in isolation are entirely parallel in function to the suffixes the DM 
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literature treats as little v (notably -r-, -s-, and -e- (see e.g. Marantz 2013: 108) to license 
a parallel treatment for them in the DM framework we are taking as representative of 
syntactic treatments of derivation.) 


(19) a. tuna-g-ar- [[[R]vc]vi] ‘connect (iy 
b. tuna-g-e- [[[R]ve]ve] ‘connect (ty 
(20) a. yasu-m-ar- [[[R]vi]vi] ‘get rested’ 


b. yasu-m-e- [[[R]vi]vc] ‘rest (ty 


If the representations of (19-20) are constructed in the syntax, in line with the proposal 
that roots and functional morphemes are the primitives of syntactic derivation, we will 
expect them to be interpreted compositionally, with the meaning of the outer little v 
combining with the result of composing the meaning of the inner little v with that of 
the root. In fact, no verb stem has an interpretation that involves two units of “little v 
meaning”, either two instances of “inchoative” or two instances of “causative” or one of 
each; for interpretive purposes, the only little v that matters in representations like those 
of (19-20) is the outer one.’ This is as if, when the Perfect auxiliary occurs outside of the 
Progressive in English or the Passive outside of the (productive) Causative in Japanese, 
as illustrated in (21), the outer auxiliary were to nullify the interpretation of the inner 
one rather than composing with it semantically. 


(21) a. have been eating [PERF[PROG[V]]] 
b. tabe-sase-rare- [[[V]CAUS]PASS] ‘be made to eat’ 


It would seem that in uncontroversially syntactic constructions like those of (21), this 
kind of nullification never occurs, and thus that we can assume that the syntactic com- 
putational system includes no mechanism for opting out of compositional interpretation 
in this way. The structures of (19-20) therefore pose a major problem for the idea that 
the suffixes deriving Japanese verb stems are syntactic elements. 

We have seen that the syntactic status of constructions like (19-20) is called into ques- 
tion by their interpretive properties. The representations of (19) pose a second problem 
as well, namely that the internal v. will introduce an external argument that must ulti- 
mately remain unrealized.!° In the remainder of this section, I concentrate on document- 
ing further instances of the construction (19a), verb stems that introduce no external 
argument in spite of containing a transitivizing suffix. 


? While the v; of (20b) could be taken to be semantically active, the meaning of such causatives would have 
to coincide with that of causatives derived from roots, as in (18b). The semantic inertness of the inner little 
v thus follows for this case as for the others. (In DM, identification of category-determining elements with 
phase heads requires that lexical causatives, being monophasal, be root-based (Marantz 2007).) 

10 The causative interpretation and the external argument may in fact be introduced by separate heads 
(Pylkkänen 2008: chapter 3); what is important for our purposes is that in the data at hand they are both 
present when a transitivizing suffix appears alone but absent when it appears inside another transitivity- 
determining suffix. 


143 


Brent de Chene 


Consider first the isoradical sets (22-25), all of which illustrate the suffix sequence 


-r-e- 


(22) a. mak- ‘roll up, wind around’ 

b. maku-r- ‘roll up, tuck up’ 

c. maku-r-e- ‘get turned up, ride up’ 
(23) a nezi ‘screw’ 


nezi-r- ‘twist’ 
c. nezi-r-e- ‘get twisted’ 
(24) a. yabu-k- ‘rip (ty 
b. yabu-r- ‘rip (ty 


c. yabu-r-e- ‘rip (i) 
(25) a. kasu-ka ‘faint, at the limits of perception’ 
b. kasu-m- ‘become hazy, dim’ 


c. kasu-m-e- ‘cloud (the vision of), deceive; graze, skim over; skim off, steal’ 
kasu-r- ‘graze (touch lightly in passing)’ 

e. kasu-r-e- ‘become faint or discontinuous (printing, writing); become hoarse 
(voice)’ 


The stems of (22-25) are all in common use in contemporary Japanese; a final parallel set 
that is particularly transparent semantically but for which the verb stems are obsolete 
is kubi ‘neck’, kubi-r- ‘strangle’, kubi-r-e- ‘die by hanging oneself’. 

Examples of the construction (19a) involving the suffix sequence -m-ar- can also be 
cited, as in (26-28). (26a) reflects the fact, not previously exemplified, that bare roots not 
infrequently occur reduplicated as adverbial items of the mimetic vocabulary. 


(26) a. kurukuru ‘round and round (rotation, winding)’ 
b. kur- reel in, wind’ 
c. kuru-m- ‘wrap by rolling’ 
d. kuru-m-ar- ‘be rolled up, wrapped up’ 
e. kuru-m-e- ‘lump together’ 
(27) a. tuka ‘hilt, handle’ 
b. tuka-m- ‘grasp’ (accusative object) 
c. tuka-m-ar- ‘be caught, captured’; ‘hold on to’ (dative object) 


tuka-m-aw-e- ‘catch, capture’ 


H Taking the root to be maku- in (22) obviates postulating a new suffix allomorph for the (b) and (c) examples 
but requires a rule deleting a root-final vowel in a zero-derived verb stem for (22a). Given also a rule a 
+ i > e, mirroring the presumed historical development (see Ono 1953 and subsequent literature), many 
apparently consonant-final roots could be reanalyzed along parallel lines; for example, the stems of (1-2) 
above could be tok-, toka-i-, toka-s- (Vtoka) rather than tok-, tok-e-, tok-as- (Vtok). 
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(28) a. haza-ma ‘gap, interstice' (< hasa-ma (ma ‘interval’)) 
b. hasa-m- ‘insert between’ 


c. hasa-m-ar- ‘get caught between’ 


In (6-8) and (22-28), then, we have seen examples in which intransitivizing suffixes 
appear outside transitivizing suffixes, resulting in stems of the shape (19a). These are 
structures for which, as a result of the internal v,, both a causative interpretation and an 
external argument are predicted, but do not materialize. We have already argued that 
the syntactic status of all four constructions (19-20) is called into question by the fact 
that the inner little v of those constructions is never interpreted. Regarding the unre- 
alized external argument of stems of the shape (19a), similarly, it is clear that there is 
no way, in a system of syntactic derivation based on selectional features and the Merge 
operation and restricted by a “no tampering” condition (Chomsky 2008: 138), for a spec- 
ifier introduced by one head to be deleted or ignored as a consequence of merger of a 
higher head. The conclusion seems inescapable, then, that a system of stem-formation 
that allows stems of the form (19a), and stems of the form (19-20) more generally, cannot 
be the result of the syntactic computational system. 


4 Verbal -m- and adjectival -si- 


In (19-20) above, we saw that transitivizing and intransitivizing suffixes, characterized 
as v, and v; respectively, can occur in any of the four logically possible orders following 
a root. We have not seen any examples, however, in which the members of an individual 
pair of suffixes appear in a given order after one set of roots but in the opposite order 
after another set. For example, the suffixes of the sequence -g-e- always occur in that 
order regardless of their status as transitivizing or intransitivizing. In fact, there are 
three possibilities in that regard: both suffixes can be transitivizing, as in (16d), the first 
can be intransitivizing and the second transitivizing, as in yawa-ra-g-e- ‘soften (t)’ (cf. 
yawa-ra-g- ‘soften (i)’), or the first can be transitivizing and the second intransitivizing, 
as in hisya-g-e- ~ hisi-g-e- ‘be crushed’ (cf. hisya-g- ~ hisi-g- ‘crush’). In this section 
we will observe two suffixes,one deriving verb stems and the other adjective stems, for 
which there are four modes of attachment to a root: direct affixation of each suffix, verbal 
suffix preceding adjectival, adjectival suffix preceding verbal, and both orders with the 
same root. It will be argued that both the fact that only the outer suffix is interpreted, 
parallel with what we saw in §3, and the fact that the relative position of the suffixes is 
an idiosyncratic function of the individual root militate against treating the suffixes as 
syntactic elements. 

Many Japanese roots support both a verb stem in -m-, exemplified in §3, and an ad- 
jective stem formed with the suffix -si-. While adjective stems in -si- are not treated 
in the DM literature on Japanese derivation, that suffix has a natural DM analysis as a 
category-determining little a, where the latter is a stative counterpart of inchoative vj 
and causative v. (Marantz 2013: 103). In the examples of (29-30), both suffixes attach 
directly to a root, making those examples parallel, as the displayed structure shows, to 
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the verb stems nao-r- and nao-s- that we saw in (5) and (18) (the root of 30 also supports a 
stem kuy-i- that is a close synonym of (30b); y deletes before a front vowel in the phrasal 


phonology). 


(29) a. suzu-si- [[R]a] ‘cool, refreshing’ 
b. suzu-m- [[R]vi] ‘cool off, refresh oneself’ 
(30) a. kuy-asi- [[R]a] ‘causing chagrin, regret’ 


b. kuy-am- [[R]v.] ‘rue, regret’ 


There are a number of roots supporting both types of stem seen in (29-30), however, for 
which the verb stem in -m- is derived from the adjective stem in -si-. This is illustrated in 
(31-32) (I take -si- to be suffixal in an otherwise unsegmentable CVCVsi- adjective stem). 


(31) a. kuru-si- [[R]a] ‘painful, uncomfortable, difficult’ 
b. kuru-si-m- [[[R]a]vi] ‘suffer’ 
(32) a. kana-si- [[R]a] ‘sad’ 


b. kana-si-m- [[[R]a]vi] ‘grieve, sorrow’ 


And there are roots for which, in contrast, the verb stem in -m-, whether transitive (as 


in 33b) or intransitive (as in 34b) serves as the base for derivation of the adjective stem 


in -si-:” 


~ 
Lu 
Sy 

— 
t 


uto- [[R]a] ‘distant, ill-informed’ 

uto-m- [[R]v.] ‘shun, ostracize’ 

c. uto-m-asi- [[[R]vc]a] ‘unpleasant, repugnant’ 
ita- [[R]a] ‘painful’ 

ita-m- [[R] vi] ‘be painful; get damaged’ 


(34) 


D 


c. ita-m-asi- [[[R]v;]a] ‘pitiable, pathetic’ 


Finally, there is at least one root for which both the verb stem in -m- and the adjective 
stem in -si- contain both suffixes, in the opposite order in the two cases: 


(35) a. tutu-m-asi- [[[R]vc]a] ‘modest, unpretentious’ 


b. tutu-si-m- [[[R]a]vc] ‘be cautious regarding; abstain from’ 


What conclusions can we draw from the data of (29-35)? First of all, with regard to 
interpretation, those examples support the same observation that was made in 83 for 
stems of the four types in (19-20), namely that when a stem contains two derivational 
suffixes, the inner one is interpretively inert.? The semantic relations of the two stems 


12 For an English parallel to the three types (29-30), (31-32), (33-34), consider ambigu-ous/ity, duplic-it-ous, 
monstr-os-ity. 

13 While one might imagine for some of the doubly suffixed stems of (31-35) that the interpretation of the 
whole depends in some way on that of the inner suffix, there is evidence against this idea in some cases. 
With respect to (34), for example, the root-reduplicated adjective itaita-si- ‘pitiable, pathetic’ shows that 
the occurrence of that meaning for the stem ita-m-asi- has nothing to do with the inner suffix -m-. 
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to each other and to the root in (35), for example, are roughly the same as in (29-30), 
even though the stems of (35) each contain two suffixes and the stems of (29-30) only 
one. This observation, as we have seen, casts doubt on the proposal that the suffixes in 
question are syntactic elements. 

A parallel argument can be made regarding the relative position of suffixes. (19-20) 
have already shown, of course, that if suffixes are divided into transitivizing (“causative”) 
and intransitivizing (“inchoative”) types, there are no constraints on their relative order 
when two of them occur in the same stem, so that their actual order in particular cases 
becomes a function of the individual root. As suggested by the discussion of the suffix 
sequence -g-e- at the beginning of this section, though, if we classify suffixes on strictly 
distributional grounds, without reference to transitivity value, it is possible to set up 
two position classes that will obviate conditioning of suffix order by roots in the great 
majority of cases: roughly speaking, the suffixes recognized by the Jacobsen/Volpe seg- 
mentation of stems will belong to the outer layer, with the inner layer being composed 
of suffixes such as -g-, -m-, -w-, and (transitivity-neutral) -r-. 

For the data of (29-35), however, conditioning of suffix order by individual roots is 
inescapable. This, then, constitutes a second way, independent of the interpretive inert- 
ness of the inner suffix, in which the behavior of -m- and -si- fails to conform to what we 
would expect of syntactic elements. Returning to the analogy with auxiliary verbs that 
we appealed to in $3 (see 21 above), the positional relations of those two suffixes are as if 
the Perfect and the Progressive auxiliaries (say) both appeared adjacent to the stem for 
one class of verbs, but the Perfect was formed by placing the Perfect auxiliary outside 
the Progressive for a second class of verbs, and the Progressive was formed by placing 
the Progressive auxiliary outside the Perfect for a third class. The reason, of course, that 
this is difficult to imagine is that we expect unambiguously syntactic elements to appear 
in a fixed order with respect to a verbal or nominal stem. Indeed, since the 1990s, a great 
deal of work in cartographic syntax (notably Cinque 1999) has developed the idea that 
the (hierarchical) ordering of syntactic functional heads is fixed not only internally to a 
single language, but universally. From that perspective, the radical failure of Japanese 
verbal -m- and adjectival -si- to display a consistent ordering makes it extremely difficult 
to view them as syntactic heads. 


5 Compositional meanings and semantic change 


We have claimed that the syntactic computational system includes no mechanism for 
opting out of compositional interpretation, in particular by allowing a higher head to 
nullify the interpretation of a lower one. More generally, it seems reasonable to assume 
that the compositional interpretation of structures generated by the syntax is automatic, 
so that there is no way to block the compositional interpretation of a syntactic con- 
stituent.^ We expect it to be true, in other words, that no instance of a syntactically gen- 
erated structure or construction can idiosyncratically fail to display the compositional 


14 T will assume that this principle is not compromised by the delayed transfer to the interfaces characteristic 
of phase-based derivation (Chomsky 2001). 
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semantic interpretation associated with that structure or construction.” As a result, a 
phrase like kick the bucket that is demonstrably generated by the syntax will automati- 
cally have the compositional interpretation predicted by its lexical items and its syntactic 
structure, independently of whether it has one or more listed interpretations as well. As 
a diachronic corollary, we can infer that loss of the compositional interpretation of a 
syntactically generated constituent is not a possible change, assuming that the grammar 
and the lexicon have remained stable in the relevant respects. Thus, it would not be pos- 
sible for kick the bucket to lose its compositional interpretation over time, retaining only 
the idiomatic one. When a phrase that was once generated by the syntax does have only 
a listed interpretation, it is either because the component words have dropped out of the 
lexicon, as is probably the case for the phrase to plight one’s troth for most contempo- 
rary English speakers, or because the grammar no longer generates phrases of the type 
in question, as is the case for the phrase till death do us part. 

What is true for manifestly phrasal constituents is true for inflected forms as well. Lex- 
icalization (i.e. idiomatization) of guts in the meaning ‘courage’ and balls in the meaning 
‘audacity’ has no effect on the status of those forms as regular plurals as long as the rel- 
evant stems and the rules for forming and interpreting plurals are diachronically stable. 
In Japanese, many verbal Gerund forms in -te are lexicalized as adverbs: sitagatte, yotte 
‘consequently’ (sitagaw- ‘obey’, yor- ‘be due to’), kiwamete, itatte ‘extremely’ (kiwame- 
‘reach, carry to extremity’, itar- ‘reach’). As long as the relevant verb stems remain in 
the lexicon and -te remains an inflectional suffix, however, there is no way that these id- 
iomatic meanings can replace the compositional meanings that the forms have by virtue 
of their inflectional (ultimately, syntactic) status. The same is true of verbal Conjunctive 
forms that have been lexicalized as nouns: nagasi ‘sink’ (naga-s- ‘make flow’), nagare 
‘flow, course of events’ (naga-re- "How US 

If loss of a compositional interpretation is not a possible semantic change, assuming 
stability of grammar and lexicon, then demonstrating that the predicted compositional 
meaning of a putatively syntactic construction is subject to loss over time will support 
the conclusion that the construction in question is not syntactic after all, since if it were, 
its compositional meaning should be diachronically stable. In the present section, I will 
make this argument with respect to the Japanese lexical causative in -s-, exemplified by 
stems like nao-s- ‘cure, repair’, seen in (5b) and (18b) above. Specifically, I will document a 
number of cases in which the construction [R[s]] can be shown to have had the predicted 
interpretation CAUS(IRI) (IRI the interpretation of R) originally but later to have lost that 
interpretation in spite of the fact that IRI itself has remained constant. 


15 Correspondingly, establishing that some phrase P is a counterexample to this principle will require (a) 
displaying P’s syntactic structure; (b) displaying the rule of interpretation associated with that structure; 
and (c) showing that P idiosyncratically lacks the predicted interpretation. 

16 The semantics of these nouns has been treated in the DM literature since Volpe (2005) as involving selection 
of root allosemes by a noun-forming suffix (“special meanings of the root triggered across the little v 
head” (Marantz 2013: 107). The extreme semantic distance that separates many of the nouns from their 
corresponding roots (abundantly documented by Volpe), however, makes idiom-formation a more plausible 
basis for the nominal meanings than alloseme choice (for the distinction between the two mechanisms, see 
Marantz 2013: 105). 


148 


7 Root-based syntax and Japanese derivational morphology 


As a first example, consider the stem yurus- ‘allow, forgive’. In Old Japanese (see 
Omodaka et al. 1967), the primary meaning of this stem is ‘slacken (t), with secondary 
meanings ‘let go of’; ‘allow, comply with, tolerate’; and ‘forgive, exempt’. Yurus-, in other 
words, is historically the causative in -s- on Vyuru ‘slack’ (see 30) above), a root that in 
modern Japanese underlies the adjective stem yuru- ‘slack’, the nominal adjective yuru- 
yaka ‘slack, gradual’, and the verb stems yuru-m- ‘slacken (i)’ and yuru-m-e- ‘slacken 
(t). As is clear from these four stems, the root has been completely stable semantically 
over thirteen centuries, and the same can be assumed for causative -s-. There is no trace 
in the modern meaning of yurus-, however, of the original concrete primary meaning 
'slacken'. That meaning, in other words, has been completely replaced by the originally 
secondary or extended meanings ‘allow’ and ‘forgive’. If yuru-s- had been a syntactic 
construction, with the meaning ‘slacken (t)’ the compositional result of a semantic rule of 
interpretation, this replacement should have been impossible, just as we have suggested 
that it would be impossible for kick the bucket to lose its compositional meaning and 
retain only the idiomatic one. 

The history of the stem itas- ‘do (humble)' is broadly parallel. In Old Japanese, it is 
the causative corresponding to itar- ‘reach a limit’, as explicitly noted in Omodaka et 
al. (1967), and thus means ‘bring to a limit’. In the modern language, while intransitive 
itar- has retained its original meaning, itas- is for the most part, bleached of concrete 
content, simply a suppletive humble variant of suru ‘do’. A third case in which a s-stem 
has lost a putatively compositional causative meaning involves konas- 'deal with, take 
care of; be skilled at’, whose primary meaning was originally ‘break up, pulverize’ and 
which is based historically on ko ‘powder’ (Ono, Satake & Maeda 1974). Like many other 
original monosyllables, ko has been replaced as a freestanding noun by a bisyllabic form, 
in this case kona, which is attested starting around 1700. The only serious proposal for 
the origin of kona (see NKD) appears to be that it is a backformation based on konas-. If 
the backformation theory is correct, kona and konas- were unquestionably isoradical at 
the relevant point in time, so that konas- consisted of J/kona ‘powder’ plus causative -s-. 
Today, however, while the root noun remains in the language, the meaning ‘break up, 
pulverize’ for the verb is extinct." 

Two further stems in -s- for which the predicted causative meaning appears to have 
been lost over time are hatas- ‘carry out, perform, accomplish’ and kuras- ‘make a liv- 
ing; live, spend (time)’. The roots appear in the zero-derived noun hata ‘edge, perimeter; 
outside’ and the zero-derived adjective stem kura- ‘dark’, respectively, and are semanti- 
cally identifiable in the intransitives hate- ‘end OU and kure- ‘darken (day), end DI (for 
the a ~ e alternation, see note 11 above). The expected primary meaning ‘end (t)’ of 
hatas- appears in the gloss ‘bring to a conclusion’ in Omodaka et al. (1967); for kuras-, 
similarly, Omodaka et al. record the expected primary meaning ‘spend the time until 
evening’ (i.e. ‘let the day darken’). In both cases, however, this compositional meaning 
is absent from the modern stems, neither of which stands in a purely causative relation 
to the corresponding intransitive or to the root. The meaning of hatas-, as the above 


17 While dictionaries retain examples like tuti o konasu ‘break up dirt (clods)’, the speakers I have consulted 
deny knowledge of such a usage. 
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definition indicates, inherently includes an element of purposive activity (carrying out 
a command, achieving a goal, fulfilling an obligation) that is absent from that of hate-. 
While the semantic difference between kuras- and kure- is more subtle, the basic fact 
preventing the former from functioning as the causative of the latter is that, unlike kure- 
(‘come to an end’), kuras- (‘spend (time)’) is atelic. Both hatas- and kuras-, then, like 
yurus-, itas-, and konas-, are cases in which the predicted interpretation CAUS(IRI) of 
the construction [R[s]] has been lost over time. 

In this section, we have seen an argument against the syntactic derivation of Japanese 
verb stems based on semantic change, using causatives in -s- as a representative stem- 
type. It goes without saying, we should emphasize, that perhaps the most common type 
of semantic change, the addition of idiomatic or extended meanings, does not count 
against the hypothesis of syntactic generation: as is well known, linguistic units of any 
size can be idiomatized, with the tendency to undergo idiomatization inversely propor- 
tional, roughly speaking, to size (Di Sciullo & Williams 1987: 14). But loss of a putatively 
compositional meaning, we have claimed, does count against syntactic generation, be- 
cause there is no reason to take the compositional interpretation of syntactic structure 
to be anything but automatic and exceptionless. In order for a compositional meaning 
M to be lost, the syntactic structure underlying it would first have to be exempted from 
compositional interpretation, with M being lexicalized at the same time; M could then 
be lost from the lexicon. If this sequence of events is impossible because exemptions of 
the required type are never granted, however, a putatively compositional meaning that 
is in fact subject to loss cannot have been based on a syntactic derivation in the first 
place. 


6 Conclusion 


Above, I have attempted to evaluate the proposal that the derivational suffixes that cre- 
ate transitive and intransitive verb stems in Japanese are syntactic heads, in particular 
varieties of little v. Crucial evidence in this regard has come from identifying an inner 
layer of derivational suffixation (-g-, -m-, etc.) in addition to the well-known outer layer 
whose main members are -r-, -s-, -re-, -se-, -e-, -i-, and zero, since this has allowed us to 
raise the question of how two derivational suffixes interact when they occur together in 
the same stem. We saw in $3 that in such a case, the inner suffix is always inert for pur- 
poses of argument structure and semantic interpretation, casting doubt on the position 
that the suffixes are syntactic elements. In $4, we saw that the same is true for combina- 
tions of the verbal suffix -m- and the adjectival suffix -si-, with the added complication 
that the order in which those two suffixes occur is an idiosyncratic function of the root. 
Finally, in $5, we argued, without reference to suffix sequences, that the combination 
of a root and a transitivity-determining suffix, taking causative -s- as a representative 
example, cannot be a syntactic construction because its putatively compositional inter- 
pretation is unstable over time. All the evidence we have seen, then, points toward the 
conclusion that the derivational suffixes under consideration are not syntactic elements. 
Equivalently, if one wishes in the face of this evidence to generate Japanese verb and ad- 
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jective stems syntactically, one will require relaxation of otherwise well-motivated con- 
straints on structure-building and interpretation precisely for the domain of the stem. 
As suggested at the outset, our conclusions in this regard support Anderson’s (1982: 594) 
position on the place of morphology in the grammar: derivation is pre-syntactic, and the 
units of lexical storage are inflectable stems; inflection, in contrast, is the post-syntactic 
spellout of morphological elements and morphosyntactic properties that are treated by 
syntactic operations. 

The conclusion that Japanese derivational suffixes, in contrast with suffixes like the 
Passive and the productive Causative, are not syntactic elements is supported at a more 
impressionistic level by the fact that, as is easily confirmed, the two sets of suffixes differ 
sharply in their degree of regularity, both formal and semantic. Formally, while varia- 
tion in the shape of the Passive suffix -(r)are- is limited to phonologically conditioned 
alternation of r with zero at the left edge, and variation in the shape of the Causative suf- 
fix -(s)as(e)- is limited to phonologically conditioned alternation of s with zero at the left 
edge and non-phonological alternation of e with zero at the right, variation in the realiza- 
tion of what under a DM analysis will be v; and v, is highly unconstrained, with multiple 
unrelated allomorphs for each of the suffixes and almost complete overlap between the 
two allomorph sets. Semantically, while the meaning of Passivepassive stems in -(r)are- 
and (apart from occasional idioms) Causative stems in -(s)as(e)- is both regular and rel- 
atively straightforward to characterize, the meaning of stems in v; and ve is in most 
cases multiply polysemous and highly idiosyncratic; the glosses we have given above, 
while aiming at a marginal increase in accuracy over the labels in Jacobsen (1992) and 
Volpe (2005), in many cases only scratch the surface of the problem of specifying stem 
meaning. With regard to semantics, it should also be remembered that, as we noted in 
82, morphological analysis internal to the stem proceeds on the basis of an unredeemed 
promissory note regarding the criterion for isoradicality and that equally serious ques- 
tions arise about how the meaning of transitivity suffixes is to be specified, given the 
apparent semantic overlap between transitivizing and intransitivizing morphology. 

If Japanese verb and adjective stems are not, then, created by the syntactic computa- 
tional system, how should we conceive of their structure and, crucially, the knowledge 
that speakers have about that structure? Broadly speaking, there are two types of an- 
swer that could be given to this question. On one of them, derivational morphology of 
the type we have seen here would be the result of a combinatory system roughly parallel 
to syntax but less regular both in terms of the hierarchical relationships holding among 
grammatical elements and the semantic interpretation of complex structures. From the 
standpoint of theoretical parsimony, of course, this would seem like an unattractive pro- 
posal; surely, if possible, we would prefer to maintain that the language faculty involves 
a "single generative engine" (Marantz 2001; 2005). Viewing language as a biological ob- 
ject, however, there would appear to be no grounds for excluding a priori the possibility 
that our linguistic capacities include a combinatory stem-formation module ofthe sort in 
question. In evolutionary terms, such a module might have provided a vastly expanded 
repertory of named concepts in advance of the emergence of a fully regular and produc- 
tive syntax, representing a sort of half-way house on the road to discrete infinity. 
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The second type of answer that could be given to the question of the form taken by 
speaker knowledge of the relations among isoradical stems, assuming that those rela- 
tions are not mediated by the syntactic computational system, is that that knowledge is 
frankly non-generative — that is, non-combinatory. In this case, all stems will be lexically 
listed, with relations among them captured by redundancy rules, for example, those of 
the type pioneered by Jackendoff (1975) (see also Jackendoff 2002: 53). What is unsat- 
isfying about this type of answer is that it provides no insight into why derivational 
morphology should exist at all — why, that is, stems (setting aside compounds) are not 
all atomic. While we have seen evidence that at least some derivational morphology 
cannot be syntactic, then, there is no unambiguously attractive alternative account of 
the structure of speaker knowledge in this area. As a result, the place of derivational 
morphology in our linguistic competence remains very much an open question. 
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Chapter 8 


Morphological complexity and Input 
Optimization 


Michael Hammond 


University of Arizona 


In this paper, we examine morphological complexity through the lens of Input Optimization. 
We take as our starting point the dimensions of complexity proposed in Anderson (2015). 
Input Optimization is a proposal to account for the statistical distribution of phonological 
properties in a constraint-based framework. Here we develop a framework for extending In- 
put Optimization to the morphological domain and then test the morphological dimensions 
Anderson proposes with that framework. 


The dimensions we consider and the framework we develop are both supported by empirical 
tests in English and in Welsh. 


1 Introduction 


Anderson (2015) lays out a number of dimensions of morphological complexity, ways 
that we might evaluate how complex different morphological systems are, e.g. number 
of morphemes in the system, complexity of principles governing combinations of mor- 
phemes, complexity of exponence, complexity of allomorphy, etc. 

These are clearly the right kinds of dimensions for evaluating the complexity of mor- 
phological systems, and we might be inclined to use them as part of a typology of morph- 
ology. However, if we adopt these as our dimensions for calculating complexity, what 
follows? As I learned from Steve Anderson years ago in graduate school, typology with- 
out implications is bad typology. (For discussion, see, for example, Anderson 1999.) 

In this paper, I consider the implications of these dimensions of morphological com- 
plexity for the theory of Input Optimization (Hammond 2013; 2014; 2016). This theory 
develops a notion of phonological complexity which languages “attempt” to minimize 
statistically. To the extent that different phonological representations are complex, they 
are under-represented statistically. This complexity shows up in the markedness of sur- 
face representations and in the complexity of input-output mappings. For example, 
marked segments or syllable structures are under-represented compared with their less 
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marked counterparts. In addition, outputs that are distinct from their inputs are under- 
represented with respect to outputs that are identical with their inputs. 

Interestingly, there are morphological effects as well, effects that sometimes work 
in the opposite direction. For example, initial consonant mutation in Welsh causes a 
mismatch between output and input, but is over-represented. I argue that this is because 
phonological complexity includes morphological mapping. Specifically, to the extent 
that morphological distinctions are not made in the surface form, a representation is 
more complex. This is formalized in OT-based terms using a constraint deriving from 
work by Kurisu (2001). 

This general approach is supported by the facts of haplology, e.g. English adjectives in 
-ly like weekly not getting double-marked with adverbial -ly and plurals like kings not get- 
ting double-marked with genitive -s. The absence of double-marking means that a mor- 
phological distinction is not made on the surface; thus these cases are under-represented 
as expected. 

These morphological cases beg the larger question: should phonological complexity 
be generalized further? Should there be a more general notion of morphological com- 
plexity, built on dimensions of the sort cited above, where forms that are more complex 
morphologically are statistically under-represented? In this paper, I pursue just this 
course, formalizing a notion of morphological complexity and then testing it with cases 
from English and Welsh with respect to the dimensions of morphological complexity 
identified in Anderson (2015). 

The organization of this paper is as follows. I first review some of the dimensions of 
complexity presented in Anderson (2015). I then outline the theory of Input Optimization 
and a framework for a constraint-based theory of morphology that we can assess Input 
Optimization with respect to. With these in hand, we then consider the predictions made 
by the Input Optimization framework and turn to the English and Welsh data. 


2 Dimensions of complexity 


Anderson (2015) discusses a number of dimensions of morphological complexity and we 
will not review them all here. 

We explicitly set aside those systemic dimensions that cannot distinguish options 
within a language. For example, Anderson cites the number of elements in the mor- 
phological system as a measure of its complexity. Thus, for example, if one language has 
one way of marking noun plurals and another has ten ways, we might think of the first 
as less complex. Input Optimization makes no predictions about systemic differences 
like these, as we will see in the next section, so we don’t consider them any further. 

Anderson cites the number of morphemes in a word as a dimension of complexity.! 
This can be taken in several ways. One possibility is that one might compare across 
different languages, which seems to be Anderson’s intent. Another possibility though 


1 This, of course, begs the question of what is a morpheme, an issue at the forefront of much of Steve’s own 
work. 


156 


8 Morphological complexity and Input Optimization 


would be to compare across words in the same language and we investigate this possi- 
bility below. 

Another dimension Anderson identifies is whether the morphemes present in a word 
depend on each other in some way. We might think of this in two ways. Some morpheme 
may only occur when “licensed” by some other. Gender in Spanish is an example of 
this. If gender is marked on a noun, then that gender marking must be present in the 
plural as well, e.g. mes+a+s ‘tables’ table + feminine + plural. Another example might be 
verbal theme vowels in Romance; person/number marking depends on the presence of 
the theme vowel. 

The other side of this coin would be cases where the presence of some morpheme 
blocks another. Haplology is an example of this. For example, adverbial -ly in English 
cannot occur on an adjective that already ends in -ly. Thus we have happily, but not 
* weeklyly. 

Anderson also distinguishes among morpheme types in terms of complexity. Simple 
prefixation or suffixation is less complex than circumfixion or infixation. Presumably, 
non-concatenative morphology like templatic operations, ablaut, umlaut, truncation are 
also more complex. 

Lastly, Anderson cites the complexity of allomorphy as an instance of general mor- 
phological complexity. We take this to mean that a system is more complex when there 
is more allomorphy. We interpret allomorphy as generously as possible to include cases 
where the phonology seems to be involved, say, in the different pronunciations of the 
English plural -s as [s, z, əz], but also plurals that differ on some other basis, e.g. geese, 
criteria, sheep, etc. 

Anderson treats some other possibilities as well, but the ones above are quite simple 
and can be examined within a single language. We list them together below. 


1. Number of morphemes in a word 

2. Principles of morphological combination, e.g. scope, haplology, etc. 
3. Complexity of exponence, e.g. circumfixes, infixes, etc. 

4. Complexity of allomorphy 


In the following section, we review the Input Optimization proposal and sketch out the 
predictions it makes for these. Our interest in Input Optimization is that it provides a 
mechanism by which we can assess the dimensions of morphological complexity we've 
just considered. 


3 Input Optimization 
The problem that Input Optimization addresses is that certain phonological configura- 


tions occur less often than we might otherwise expect. For example, if we look at the 
distribution of stress on two-syllable adjectives, we see that adjectives with final stress 
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like alert [al{t] or opaque [op*ék] are less frequent overall. Strikingly, both are even less 
frequent when they occur prenominally. 

Hammond (2013) argues that this effect is driven by the English Rhythm Rule (Liber- 
man & Prince 1977; Hayes 1984). Certain stress configurations in English are avoided 
by shifting a primary stress leftward onto a preceding secondary. Thus we have alterna- 
tions like Minnesóta vs. Minnesota Mike; thirtéen vs. thirtéen mén; etc. When an adjective 
with final stress occurs in prenominal position, the relevant configuration is quite likely 
to occur. In addition to following context, there is a restriction on preceding context. 
With an adjective like opaque, stress shift leftward is possible because of the preceding 
stress, e.g. ópáque story, but with an adjective like alért, such a shift is impossible and the 
offending configuration must surface, e.g. alért pérson. Both kinds of cases are statisti- 
cally under-represented in English. Specifically, these configurations arise significantly 
less frequently than we might expect based on the overall distribution of adjectives with 
these stress patterns. 

Input Optimization is a proposal to account for statistical skewings like these that 
occur in the phonologies of languages. The idea is developed in Hammond (2013; 2014; 
2016). The basic idea is that markedness and faithfulness violations are avoided in the 
phonology so as to reduce the complexity of the phonological system. Input Optimiza- 
tion is a generalization of the notion of Lexicon Optimization Prince & Smolensky (1993): 


(1) Lexicon Optimization: 
Suppose that several different inputs I;, ,...,[, when parsed by a grammar G 
lead to corresponding outputs O;, O2, . . . , On, all of which are realized as the 
same phonetic form $—these inputs are all phonetically equivalent with respect 
to G. Now one of these outputs must be the most harmonic, by virtue of incurring 
the least significant violation marks: suppose this optimal one is labelled Ox. 
Then the learner should choose, as the underlying form for 6, the input Ix. 


The idea is that if there are multiple ways to produce an output form consistent with the 
facts of a language, the learner chooses the input that produces the fewest constraint 
violations. There are no empirical consequences to Lexicon Optimization by itself. In 
fact, it is defined to apply only when there are no consequences. 

To refine this into something we can use, we define a notion of Phonological complexity 
that applies to individual input-output pairings, but also to entire phonological systems. 
(The basic logic of this is that the complexity of a phonological system is proportional 
to the number of asterisks in its tableaux.) 

We define the output/surface forms of a language as a possibly infinite set of forms. 


(2) O = IO, O2, ..., On,- --} 
Each output form has a corresponding input: 


(3) Izíhb,....I...) 


The phonology is comprised of a finite sequence or vector of constraints: 
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C= (Ci, Co, ..., Cn) 


Any input-output pairing, (I;, O;), then defines a finite vector of violation counts, some 


numb 


(5) 


er of violations for each constraint earned by the winning candidate for that input. 


(nc. He, Hie, 


With these notions, Phonological Complexity (PC) is defined as follows: 


(6) 


Phonological Complexity (PC) 

The phonological complexity of some set of forms is defined as the vector sum of 
the constraint violation vectors for surface forms paired with their respective 
optimal inputs. 


To produce a relative measure of PC given some set of n surface forms, divide the 
PC score for those forms by n. 


Hammond (2016) exemplifies this with a hypothetical example of nasal assimilation. 
Imagine we have the forms in (7) we wish to compute the PC for. Given the inputs pro- 
vided in column 2, we have the constraint violations for winning candidates in columns 3 


and 4. 


(7) 


Input | Output | NC | IO-FarrH 


* 


* 


/anku/ | ag ku 
/in ga/ in ga 
/onke/ | og ke 


* 


a. /onpi/ | ompi 

b. /anba/ | amba T 
c. /unbo/ | umbo S 
d. /endo/ | endo 

e. /onta/ on ta 

f /unti/ un ti 

g. 

h. 

i. 


* 


0 6 


The relative complexity of this first system is: (0,6)/9 = (0,.66). We can compare the 
system in (7) with the one below in (8). Here we have a different array of output forms, 
but the same logic for inputs and constraint violations. 


(8) 


Input | Output | NC | IO-FarrH 
/on pi/ | om pi ii 
Jan bal | am ba 
/en do/ | en do 
/on ta/ on ta 
/un ti/ un ti 

/in di/ in di 
/anku/ | ag ku 
/in ga/ in ga 


* 


TM myo Ao ES 
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The relative complexity of the second system is: (0,4)/8 = (0,0.5), less than the first. 
As argued by Hammond (2016), this notion extends obviously to weighted constraint 
systems. For example, in a system with strict ranking, (0.1, 0.4) is more complex than 
(0, 0.5). 

The proposal then is that all phonological systems are skewed to be less complex. 


(9) Input Optimization 
All else being equal, phonological inputs are selected that minimize the 
phonological complexity of the system. 


Note that (9) alters the frequency of input-output pairings and does not change the 
input-output mapping of any particular form. For example, this principle prefers (8) 
to (7), though both systems contain the same pairings. The difference is in the relative 
frequency of the pairings that occur. 

Our goal in this paper is to see if it is profitable to extend this system to include morph- 
ology. In point of fact, Hammond (2016) addresses this question partially in response to 
statistical effects in Welsh. In particular, Welsh initial consonant mutation is statistically 
over-represented when, based on what we have seen so far, we might have expected the 
opposite. 

Consonant mutation in Welsh refers to a set of phonological changes that apply to 
initial consonants in specific morpho-syntactic contexts. For example, the Soft Mutation 
makes the following changes: 


(10) Orthographic Phonological 
Input Output | Input Output 

p b p b 

t d t d 

c g k g 

b f b M 

d dd d ð 

g 0 g 0 
m f m M 
Il l i l 
rh r I r 


There are many contexts where this occurs, e.g. after certain prepositions, direct object 
of an inflected verb, after certain possessives, feminine singular nouns after the article, 
etc. The following figure gives some examples after the preposition i [i] ‘to’. 


(2) pen [p"en] ‘head’ iben [iben] ‘toahead’ 
cath  [k'a0] ‘cat’ igath [iga0] ‘toacat’ 
mis [mis] ‘month’  ifis [i vis] ‘to a month’ 
nai [naj] ‘nephew? inai [i naj] ‘to a nephew’ 
siop [fop] shop’ isiop [ifəp] ‘toa shop’ 


160 


8 Morphological complexity and Input Optimization 


The chart above also includes examples of non-mutating consonants. Note that words 
with these occur in mutation contexts with no change. 

The Input Optimization framework would seem to predict that mutation should be 
under-represented. After all, mutation entails a faithfulness violation and, all else being 
equal, the system is less complex to the extent that such violations are avoided. This, 
however, is not what occurs. Instead, we get over-representation in mutation contexts. 
Words that begin with consonants that can mutate are over-represented in mutation 
contexts compared with words that begin with consonants that do not mutate. 

To capture this, Hammond (2016) proposes the (revised) Realize Morpheme constraint 
(12). This is a slight revision of a constraint that Kurisu (2001) motivates on other (non- 
statistical) grounds. This constraint basically militates for the expression of morpholog- 
ical information. 


(12) REALIZE MORPHEME (revised) (RM’) 
Let o be a morpheme, B be a morphosyntactic category, and F(a) be the 
phonological form from which F(a+) is used to express a morphosyntactic 
category p. Then RM’ is satisfied with respect to P iff F(a+B) + F(a) 
phonologically. 


With this in hand, the reason why Welsh mutation is over-represented is to reduce 
phonological complexity by minimizing violations of RM’. 

The RM' constraint is also invoked by Hammond (2016) to account for haplology in 
English. We've already cited the fact that forms like * weeklyly are blocked. Similarly, 
we find overt marking of the genitive in English does not occur on plural forms marked 
with -s; the genitive plural of cat is cats’, not something like * catses. Both kinds of cases 
are statistically under-represented in English: they are avoided to minimize violations 
of RM’. 

While RM’ (12) does what’s required, it begs the question of whether a more general 
version of PC is appropriate. In other words, beyond the effects of RM’, do we expect 
Input Optimization to apply to morphology? 


4 Constraint-based morphology 


To assess this, we need a constraint-based theory of morphology. There have been a 
number of proposals over the years for how to deal with morphology generally in an 
OT-like framework. The earliest we know of are Russell (1993; 1995); Hammond (2000), 
but see Aronoff & Xu (2010); Xu & Aronoff (2011) for more recent and fuller proposals. 
A full-on theory of this sort is well beyond the scope of this paper, but let’s lay out what 
such a theory might look like, at least in sufficient detail so we can assess whether it 
makes the right predictions about Input Optimization. 

Let us assume that morphology—like phonology—is a constraint-based system map- 
ping inputs to outputs. Inputs are denuded of any morphological marking, but have suf- 
ficient featural information so that we can evaluate whether morphologically marked 
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candidate forms satisfy relevant constraints. For example, we might imagine that plural 
marking in English comes about by taking a stem marked [+plural] and adding various 
affixes or performing other operations that do or do not express that feature. The idea is 
that the syntax provides a featurally complex object that the morphology can then inter- 
pret. Morphological operations like affixation, reduplication, mutation, etc. add features 
which do or do not match those required by the syntax. Following is a schematic partial 
tableau to give a sense of this. 


- Da 


*pl 


| 
slm 
| 


cat | 


cat -ed 
*pl | ii | *past | 

We would want constraints that force the correct morphological operation to take 
place. Presumably there would be one or more constraints that enforce a correspon- 
dence between the features required by the stem and the features offered by any affixes 
or other changes; to the extent that those don't match, we would have violations. For 
convenience, let's call this FEATURES (Fs). The RM' constraint above, or constraints that 
get the same effects, should fall in this class. 

We also need constraints that militate against gratuitous morphological operations. 
Some of this might be achieved by featural correspondence imposed by Fs, but we surely 
need something to account for the relative markedness of morphological operations gen- 
erally. Perhaps something like this: 


(14) *ABLAUT > *INFIX >> *PREFIX > “SUFFIX 


The basic idea is to posit constraints that militate against any morphological operation. 
These constraints are ranked with respect to each other, presumably in a universal fash- 
ion. This hierarchy would then be interleaved with the Fs constraint. For example, we 
might have: 


(15) *ABLAUT > *INFIX >> FEATURES > *PREFIX >> “SUFFIX 


The effect of such a ranking is that the featural needs of a stem can be met by prefixation 
and suffixation, but not by other operations. 

This system is woefully incomplete and, in its present form, cannot do justice to the 
full range of effects we see in morphological systems. See, for example, Anderson (1982; 
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1992). It is, in some ways, quite similar to these proposals in treating affixation as an in- 
stance of more general morphological operations that interpret syntactically-motivated 
features. However, our goal here is not to develop a full-on constraint-based morpho- 
logical theory. Rather, the point is to build enough of such a theory so that we can test 
Input Optimization with respect to the dimensions of morphological complexity identi- 
fied above. 

Let’s now return to our dimensions and consider one by one what our theoretical 
skeleton in conjunction with Input Optimization predicts. First, we have the number of 
morphemes. All else being equal, the system certainly as developed militates for as few 
morphemes, or other morphological operations, as possible. Additional morphology 
entails violations of the constraints in (14) and Input Optimization predicts these should 
be avoided statistically. 

The second dimension of complexity refers to principles of morphological combina- 
tion. The system we've developed says nothing (so far) about the licensing side of this, 
but it does address morphological haplology. To the extent that haplology occurs, it en- 
tails violations of RM’ (12) and of Fs. Previous work cited above has already established 
that Input Optimization applies in these cases. 

The third dimension is the complexity of exponence, that certain morphological op- 
erations are intrinsically more complex than others. This is captured by the ranking, 
e.g. in (14). We expect morphologies to be statistically skewed against violations of the 
higher-ranked constraints. 

The fourth dimension is complexity of allomorphy, allomorphy that is a consequence 
of phonology or morphophonology like English plural [s, z, az], but also plurals that 
differ on some other basis, e.g. geese, criteria, sheep, etc. The phonological cases fall 
under the core Input Optimization proposal. In fact, Hammond (2013) shows statistical 
skewing for English plural and past allomorphy in just the expected directions. The other 
case cited is also accommodated by the proposal. Internal modifications like geese or 
truncation+suffixation like criteria violate higher-ranked constraints than simple plural 
suffixation; hence they should exhibit under-representation. Similarly, plurals with no 
change like sheep should violate RM’ and Fs and be under-represented. 

Summarizing, a constraint-based morphological theory of the sort sketched out, in 
conjunction with Input Optimization, makes the following predictions: 


(16) 1 Words should have fewer morphemes. 

2 Haplology should be avoided. (This has already been established by Ham- 
mond 2016.) 

3 More marked morphological operations (per the hierarchy above) should be 
avoided. 

4 Morphophonology should be avoided. (This has already been established by 
Hammond 2013.) 

5 Ablaut, umlaut, truncation, etc. should be avoided. (This is essentially the 
same as #3 above.) 

6 Zero-marking should be avoided. 
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We must therefore examine #1, #3/5, and #6 empirically. In the next sections, we look 
at all three cases with data from English and Welsh. 


5 Number of morphemes 


The first prediction of Input Optimization applied to our toy constraint-based theory of 
morphology is that a form is more complex if it has more morphemes. This is a bit tricky 
to test. In many cases, having fewer morphemes is not necessarily the less complex 
option. For example, consider the plural form sheep which lacks an overt plural suffix. 
Is this less complex than a form like cat+s? Probably not. The most reasonable analysis 
given the framework above is that the plural sheep surfaces with an undischarged plural 
feature. On that view, it is not clearly less complex than a form like cat+s. 

We might also think of strong verb forms like spoke, as compared with look+ed. Here, 
however, it would be a mistake to view spoke as having fewer morphemes than look+ed. 
Rather, there is some operation, perhaps mostly lexical, for creating or selecting strong 
verb forms when available. Presumably, this would add to the complexity of spoke. 

To find a case without these alternative analyses, we turn to Welsh plurals. Welsh has 
a number of ways of forming plurals. For example: 


(17) Singular Plural 
ysgol [dsgol] ‘school’ ysgolion [asgsljon] 
cyfarfod [k'avárvod] ‘meeting’ cyfarfodydd  [k*avarvsdid] 
cynllun [k*Sndin] ‘plan’ cynlluniau [k'oniínjauj] 
problem [p'róblem] ‘problem’ problemau [p'roblémauj] 
panel [p'ánel] ‘panel’ paneli [p*anéli] 
pwnc [p'ógk] ‘subject? pynciau [p'áógk^jau]] 
angen [ágen] ‘need’ anghenion [aghénjon] 
gorchymyn  [goryómin] ‘order’ gorchmynion  [gorymónjon] 
alarch [alary] ‘swan’ elyrch [éliry] 
castell [káste1] ‘castle’ cestyll [k'ésti] 


Note that there are different suffixes and stem changes. 

There is another class of nouns, however, where it is the singular that is marked rather 
than the plural. The singular is always marked with either -yn in the masculine gender 
or -en in the feminine gender. For example: 
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(18) Singular 
mochyn 
blewyn 
morgrugyn 
marworyn 
eginyn 
mefusen 
coeden 
derwen 
madarchen 
moronen 


madaryen] 
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Plural 

‘pig’ moch [m3y] 
‘hair’ blew [bléw] 

ant morgrug  [mórgrig] 

ember marwor  [márwor] 
‘sprout’ egin [gin] 
‘strawberry? mefus [mévis] 
‘tree’ coed [k"»urd] 
‘oak’ derw [déru] 
‘mushroom’ madarch — [mádary] 
‘carrot moron [móron] 


There are some blended cases as well, where nouns marked for the singular co-occur 
with stem changes or take plural suffixes as well. For example: 


(19) Singular Plural 
merlyn [mérlin] ‘pony’ merlod [mérlod] 
oedolyn [»ujd3lin] ‘adult’ oedolion  [»ujd3ljon] 
taten [t'át^en] ‘potato’ tatws [t'át^os] 
(a)deryn [(a)dérin] ‘bird’ adar [adar] 
gwreiddyn  [gwréjóin] ‘root’ gwraidd [gwrájč] 
deilen [déjlen] ‘leaf’ dail [dajl] 
hwyaden [hujáden] ‘duck’ hwyaid [hujajd] 
cneuen [k'néuggn] ‘nut’ cnau [k'naéuq] 


The existence of the pairs where the singular is marked instead of the plural allows us 
to test the number of morphemes prediction without the problems of the English cases 
above.” On one hand, we take nouns which mark the plural with -(i)au, the most frequent 
plural suffix, and no associated stem changes. On the other, we take nouns which mark 
the singular with -en or -yn, and no associated stem changes or plural marking. In other 
words: problem/problemau, etc. vs. coeden/coed, etc. What we're interested in is whether 
there is a difference in the relative frequency of singular and plural forms in the two 
classes as a function of whether the form has an extra morpheme. Since we have both 
types of marking in Welsh, we can do this independent of the relative frequency of 
singulars and plurals in the language. 

Individual lexical items have different frequencies of occurrence, so we must equalize 
for this. We therefore take the ratio of singular to plural as a measure of the relative 
frequency of singular and plural. Since this is a ratio, it abstracts away from the overall 
frequency of each pair. 


? One might counter that it's possible to treat these as instances of subtractive morphology. There are two 
arguments against this. First, the singulatives are always marked with the suffixes -yn or -en (depending 
on gender). Second, there are cases where nouns end in these phonetic sequences where they are not suf- 
fixes. In these cases, normal plural formation occurs. For example: emyn/emynau ‘hymn’, terfyn/terfynau 
‘boundary’, ffenomen/ ffenomenau ‘phenomenon’, awen/awenau ‘inspiration, muse’, etc. 
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For this investigation, we use the CEG corpus (Ellis et al. 2001). This is a tagged written 
corpus of 1223501 words. For each word form, it also includes lemmas, so it is possible 
to determine singular-plural pairs fairly easily. In this corpus, we find 885 distinct pairs 
where the plural is marked with -(i)au and 41 distinct pairs where the singular is marked 
with -en or -yn. (As above, in both cases, we exclude pairs where stem changes are 
involved.) 

When the plural is marked, the ratio of singulars to plurals is 11.08; when the singular 
is marked, the ratio is 1.26. Singulars greatly outnumber plurals that are marked, but 
singulars occur far less frequently when they are marked instead. This difference is 
significant: (920.287) = —8.267, p < .001. This is consistent with the hypothesis that 
forms with more morphemes are more complex. 


6 Marked morphological operations 


Let's now consider the question of whether more marked morphological operations are 
under-represented. If they are, this would be consistent with Anderson's typology and 
Input Optimization. 

To test this, let's look at the distribution of plurals in English using the tagged Brown 
corpus (Kuéera & Francis 1967). The Brown corpus is a fairly old written corpus of 928181 
words. The advantage of using it here is that it is tagged, so identification of singular and 
plural nouns is relatively easy, and it is widely used and available. 

Focusing on plural nouns, we can separate them into regular plurals marked with -s 
vs. other plural forms, e.g. men, stigmata, radii, oxen, etc. When we pair these up with 
their respective singular forms, we get the following overall counts: 


(20) Type Pl. tokens Sg. tokens  Pairs/types 
-S 46083 111904 4266 
Irregular 2891 2517 79 


Overall, there are far more regular than irregular forms, but this is, of course, to be 
expected by the very definition of “irregular”. It is, however, also consistent with Input 
Optimization. The complexity of a system can be enhanced by limiting the number of 
forms that exhibit marked properties. It can also be enhanced by limiting the distribution 
of forms that do have those properties. 

Is there a difference in the likelihood of a plural form given its regularity? If irreg- 
ular forms are more complex, then we would expect their use to be statistically under- 
represented because of Input Optimization. To test this, we calculate the ratio of singular 
to plural tokens for each noun pair. This ratio allows us to examine the relative distri- 
bution of singular and plural forms, abstracting away from the overall frequency of any 
specific lexical item. The difference is plotted in Figure 1. 

Strikingly, the difference goes in the wrong difference here: irregular plurals are more 
frequent relative to their singular forms than regular -s plurals with respect to their 
singular forms. This difference is significant: t(90.318) = 3.151, p = 0.002. 


166 


8 Morphological complexity and Input Optimization 


D 
= 
bo 
_ 
—- o 
d e 
_ 
3 
O 
z= 
D o 
2 k 
D 
E 
o 


o —— DRE 


irregular -S 


Figure 1: Singular-to-plural ratios for regular and irregular plurals in English 


We conclude that the distribution of irregular plurals is ambiguous. In terms of relative 
frequency of singulars and plurals, the distribution goes in the wrong direction. In terms 
of overall distribution, however, it goes in the right direction. There are 2891 instances of 
irregular plurals in Brown and 46083 instances of regular plurals. If we were to assume 
that both types were equally likely, the difference is certainly significant: X?(4344, N = 
48974) = 425346.797, p < .001. 


7 Zero marking 


Let’s now turn to zero marking. The claim is that zero marking is more complex and 
therefore the prediction is that zero marking should be under-represented. 

We examine this with respect to plurals in English in the Brown corpus. Zero-marked 
plurals in English includes examples like: deer, aircraft, buffalo, etc. The difference in 
ratios between regular plurals in -s and zero plurals is shown in Figure 2. 
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Figure 2: Singular-to-plural ratios for regular and zero plurals in English 


Zero-marked plurals are far more frequent—relatively speaking—than regular plu- 
rals. Unfortunately, the variance is quite high—there is a lot of variation within each 
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category—and though the mean difference is large, it is not significant: (19.002) = 
—1.416, p = 0.173. As with the irregular plurals, however, the absolute difference is 
significant. There are 184 instances of zero plurals in Brown and 46083 instances of reg- 
ular plurals. If we were to assume that both types were equally likely, the difference is 
certainly significant: X^(4285, N = 46267) = 311130.115, p < .001. Again then, though 
the relative count is not significant, the absolute count goes in the right direction. 


8 Summary 


Our goal here has been to test the dimensions of morphological complexity proposed 
in Anderson (2015) with the theory of Input Optimization. As reviewed above, Input 
Optimization maintains that grammatical complexity, as assessed through constraint 
violation, is minimized at the input level of the grammar. Specifically, we should see 
under-representation of more marked morphological structures. 

We picked out several dimensions of morphological complexity to examine, some of 
which have already been treated with respect to Input Optimization. The following list 
is repeated from Section 3 and annotated to reflect our results. 


(21) 1 Words should have fewer morphemes. This is borne out by the distribution of 
marked plurals and marked singulars in Welsh. 

2 Haplology should be avoided. (This has already been established by Ham- 
mond 2016.) 

3 More marked morphological operations (per the hierarchy above) should be 
avoided. This was tested with respect to English plurals and is borne out in an 
absolute comparison, but not in a relative comparison. 

4 Morphophonology should be avoided. (This has already been established by 
Hammond 2013.) 

5 Ablaut, umlaut, truncation, etc. should be avoided. (This is essentially the 
same as #3 above.) 

6 Zero-marking should be avoided. This was tested with respect to English plu- 
rals and is borne out in an absolute comparison, but not in a relative comparison. 


First, all else being equal, we expect forms with more morphemes to be dispreferred 
to forms with fewer morphemes. We saw that this was borne out in a comparison of 
singular-plural pairs in Welsh where in some cases the singular has an extra morpheme 
and in others the plural has an extra morpheme. 

Second, we predict that morphological haplology should be under-represented. This 
was established in previous work with respect to the English genitive plural and adjec- 
tives in -ly. 

Third, more marked morphological operations should be under-represented with re- 
spect to less marked morphological constructions. We saw an overall effect here with 
English irregular noun plurals. We also saw that the relative distribution of plurals with 
respect to singulars went in the opposite direction. 


168 


8 Morphological complexity and Input Optimization 


Fourth, we predict that morphophonology should be avoided. This was established in 
previous work with respect to morphophonology associated with English past -ed and 
plural -s. 

The fifth point is the same as the third. 

Finally, zero-marking should be under-represented. We saw an overall effect here with 
English zero-marked noun plurals. We also saw that the relative distribution of plurals 
with respect to singulars went in the opposite direction. 

We conclude that the parameters of complexity developed in Anderson (2015) tested 
here and in previous work are correct. 

We have seen that there is some divergence in the absolute and relative representation 
of plural marking, but we leave investigation of that for future work. 
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In this paper we address an unusual pattern of multiple exponence in Lusoga, a Bantu lan- 
guage spoken in Uganda, which bears on the questions of whether affix order is reducible 
to syntactic structure, whether derivation is always ordered before inflection, and what mo- 
tivates multiple exponence in the first place. In Lusoga, both derivational and inflectional 
categories may be multiply exponed. The trigger of multiple exponence is the reciprocal 
suffix, which optionally triggers the doubling both of preceding derivational suffixes and 
of following inflectional suffixes. In these cases, each of the doubled affixes appear both 
before (closer to the root) and after the reciprocal. We attribute this pattern to restructuring, 
arguing that the inherited Bantu stem consisting of a root + suffixes has been reanalyzed as 
a compound-like structure with two internal constituents, the second headed by the recip- 
rocal morpheme, each potentially undergoing parallel derivation and inflection. 


1 Introduction 


Among the most important contributions of Steve Anderson’s realizational approach 
to morphology have been his early insistence that morphology is not reducible to syn- 
tax, his argument that formal theoretical models of morphology need to take different 
approaches to derivation and inflection (“split morphology”), his development of mor- 
phological rule ordering as the mechanism of ordering affixes, and his postulation that 
redundant (inflectional) morphological exponence is actively avoided by grammars. Ac- 
cording to Anderson (1992), derivational morphology takes place in the lexicon, while 
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inflectional morphology takes place in the syntax. Inflectional morphology is realized by 
the application of ordered rules which spell out features supplied by syntactic principles 
such as agreement. The best evidence that the ordering of inflectional affixes cannot sim- 
ply be read off of syntactic structure comes from morphotactics which have no analogue 
or simple justification in syntax. 

In this paper we address some rather unusual facts from Lusoga, a Bantu language 
spoken in Uganda, which bear on the questions of whether affix order is reducible to 
syntactic structure, and whether derivation is always ordered before inflection, particu- 
larly as concerns multiple exponence. In §2 we introduce the Bantu verb stem and briefly 
summarize what has been said about the ordering of derivational suffixes within it. After 
reviewing the findings that much of this ordering is strictly morphotactic, not following 
from syntactic scope or semantic compositionality, in §3 we discuss multiple exponence 
among the Lusoga derivational verb extensions. In §4 we then turn to the original con- 
tribution of Lusoga, which shows multiple exponence of inflectional agreement as well 
as unexpected intermingling of inflectional and derivational affixation. We present our 
analysis in §5 and conclude with a few final thoughts in §6. 


2 The Bantu verb stem 


Most overviews of the Bantu verb stem assume a structure with an obligatory verb root 
followed by possible derivational suffixes (“extensions”), and ending with an inflectional 
final vowel (FV) morpheme. As shown in (1), the verb stem may in turn be preceded by 
a string of inflectional prefixes to form a word: 


(1) word 
inflectional prefixes stem 


subject-TAM-object-etc. 


root-extensions-FV 


While this structure has been reconstructed for Proto-Bantu (Meeussen 1967), there is 
much variation on how the different derivational “verb extensions” are ordered. As 
shown in Hyman (2003b), most Bantu languages show at least a tendency to favor the 
“CARP” template in (2), for which we give reflexes in several Bantu languages: 


(2) C(ausative) A(pplicative)  R(eciprocal)  P(assive) 
Shona -is- -il- -an- -w- 
Makua -ih- -il- -an- -iw- 
Chichewa -its- -ir- -an- -idw- 
Lusoga -is- -ir- -agan- -(ib)w- 
Proto-Bantu *-IC- *-1d- *-an- *-IC-o- 


The arguments for recognizing the CARP template include the following: 
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(i) Certain pairs of co-occurring suffixes must appear in a fixed surface order. This 
is true of the causative + applicative (CA), which can co-occur only in this order, inde- 
pendent of their relative scope. Compare the following two examples from Chichewa, 
in which applicative -ir- introduces an instrument (Hyman & Mchombo 1992; Hyman 
2003b). Scope (schematized on the right) varies across the two examples, but surface 
order is the same: 


(3) a. applicativized causative: 
lil-its-ir- ‘cause to cry with’ [[ cry ] -cause-with ] 

b. causativized applicative: 
takas-its-ir- ‘cause to stir with’ [[ stir-with ] -cause ] 


(ii) Non-templatic orders which are driven by scope can occur with certain sets of suf- 
fixes, but are typically limited and show a “compositional asymmetry”: The a-templatic 
order is restricted to the reading in which the surface order corresponds to relative scope, 
while the templatic order can be interpreted with either possible scope relations (e.g. re- 
ciprocalized causative, causativized reciprocal). The two orders of causative and recipro- 
cal (CR, RC) illustrate this property in (4), again from Chichewa: 


(4 a templatic CR: 
mang-its-an- ‘cause each other to tie’ [[ tie ] -cause-e.o. ] 
‘cause to tie each other’ [[tie-e.o. ] -cause ] 
b. a-templatic RC: 
mang-an-its- ‘cause to tie each other’ [[ tie-e.o. ] -cause ] 
*'cause each other to tie’ 


As seen in (4a), the templatic CR order allows either scope interpretation, while the a- 
templatic RC order in (4b) can only be used to express a causativized reciprocal. The same 
facts are observed in cases where the a-templatic order of applicative and reciprocal is 
reinforced by an A-B-A “copied” sequence: 


(5 a. templatic AR: 
mang-ir-an- ‘tie (sth.) for each other’  [ [ tie ] -for-e.o. ] 
‘tie each other for (s.o.)’ _[ [ tie-e.o. ] -for ] 
b. a-templatic RAR: 
mang-an-ir-an- ‘tie each otherfor(s.o) _[ [ tie-e.o. ] -for ] 
* “tie (sth.) for each other’ 


Again, as seen in (5a), the templatic AR order can have either scope (reciprocalized 
applicative, applicativized reciprocal), while in (5b) the a-templatic (RA) + copy (R) se- 
quence can only be compositional, hence an applicativized reciprocal. (We will see such 
A-B-A sequences in Lusoga in §3.) 

(iii) A third argument for CARP is that at least one language, Chimwiini, allows only 
this order, whereas no Bantu language allows verb extensions to be freely ordered by 
scope. Thus, Abasheikh (1978: 28) writes: 
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“In Chimwi:ni, unlike some other Bantu languages, the order of the extensions is 
restricted. The following ordering of the extensions mentioned above is as follows: 
- Verb Stem - Causative - Applied - Reciprocal - Passive. It is not possible to put 
these extensions in any other order? 


Other than stative -ik-, which is more restricted in its co-occurrence with other suffixes, 
the above summarizes the general picture for the productive extensions which are in- 
volved in valence. Even given the occasional variations, e.g. Kitharaka (Muriungi 2003), 
which reverses the applicative and reciprocal, hence the order CRAP, the evidence points 
unequivocally to the fact that extension order is determined primarily by template. 

The importance of templaticity is also seen from the existence of one other valence- 
related suffix, the short causative -i- (I) which typically occurs between the reciprocal and 
passive, hence CARIP (see also Bastin 1986, Good 2005). Although both *-rc- (> -rs-, -is-) 
and *-i- were present in Proto-Bantu, *-rc- occurred only in combination with *-i-, hence 
*-1c-i- (cf. Bastin (1986). However, as summarized in (6), the current distribution of the 
two extensions (as well as the productivity of -i-) varies considerably across different 
Bantu languages (Hyman 2003b: 261): 


(6) a. -is-i- and -i- : Kinande, Luganda, Lusoga 
b. -is- only : Chichewa, Shona, Zulu 
c. -i-only (or almost only) : Nyamwezi, Nyakyusa 


The fact that -is- is the linearly first extension and -i- a quite later extension in CARIP, for 
reasons not motivated by scope, presents one more reason to accept a templatic, rather 
than compositional approach to Bantu verb extensions. However, this conclusion is not 
without interesting complications. As shown in such studies as Hyman (1994; 2003a) 
and Downing (2005), -i- frequently produces frication of a preceding consonant (a.k.a. 
Bantu spirantization) with potential multiple (cyclic) effects, as seen from the following 
examples in which -i- co-occurs with the (non-fricativizing) applicative -il- suffix in (7) 
from Cibemba: 


(7) lub- ‘be lost’ lil- ‘cry’ UR 
lub-i- ‘lose’ lil-i- ‘make cry’ Morphology (I) 
luf-i- lis-i- Phonology 
luf-il-i- ‘lose for/at’ lis-il-i- ‘make cry for/at Morphology (A) 
luf-is-i- lis-is-i- Phonology 


In both outputs, the applicative and short causative exhibit the expected surface AI 
order. However, the frication of lub- ‘be lost’ and lil- ‘cry’ to luf- and lis- suggests that at 
some level of representation, -i- is root adjacent. Hyman (1994) adopts the above cyclic 
analysis in which morphology and phonology are interleaved (see e.g. Kiparsky 1982): 
-i- combines with the root on the first morphological cycle, triggering a phonological 
application of frication on the root. When the applicative is added on the next cycle of 
morphology, it is “interfixed” between the root and the short causative, in conformity 
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with the AI order required by the CARIP template. This example illustrates the surface 
nature of the template. 

Although it is not part of the CARIP template of valence-changing derivational suf- 
fixes, the “final vowel” (FV) inflectional ending position is also templatic in that it is 
required in most Bantu languages. The set of suffixes that may appear in the FV position 
includes past tense *-r-, subjunctive *-e, and (in most other contexts) default *-a. The -€ 
portion of perfective *-il-e, which we will encounter in §4, is also in this slot, even as 
the -il- portion is sometimes considered to be part of the extension system. The custom- 
ary reason for assuming bimorphemic *-il-e is that the short causative (I) and passive 
(P) occur between the two parts, hence *-il-i-e and "-il-o-e (Bastin 1983). If we assumed 
that *-il-e was monomorphemic, we would have to assume some kind of exfixation or 
metathesis of the causative and passive with the [il] portion of -ile. There is a second 
argument from Lusoga (and Luganda): Whenever causative -i- or passive -u- is present, 
the FV of the perfective complex is -a (see (9) and note 5 below). We assume that -il- 
occurs in the template ordered before -I-P- with the function of perfectivizing the ex- 
tended derivational base so it can accept -e or -a (cf. §4).) With this established, we are 
ready to go on to the issues that arise in Lusoga. 


3 Lusoga verb extensions 


As mentioned above, Lusoga is spoken in Uganda and is the Bantu language most closely 
related to Luganda. The data cited in this study were contributed by Fr. Fred Jenga, a 
native speaker from Wairaka (Jinja District). 


3.1 Long and Short Causatives 


Lusoga exhibits the CARIP template discussed above, where C refers to the long causat- 
ive -is- and I refers to the short causative extension -i-. In fact, Lusoga uses both -is-i- and 
-i- productively and often interchangeably, to express both causation and instrumentals: 
-lim-is-i-, -lim-i- ‘cause to cultivate, cultivate with (sth.)'. As indicated, -is- cannot occur 
without -i-, while the reverse is possible. The two causative morphs are quite consistent 
in their CARIP templatic ordering with respect to the applicative, namely, -is-il-i- (CAT), 
-il-i- (AD), which are realized as -is-iz- and -iz- by the following processes: 


(8) ‘make cultivate for/at’ 
lim-is-il-i-a  lim-il-i-a UR 
lim-is-iz-i-ca  lim-iz-ia  frication 
lim-is-iz-y-a  lim-iz-y-a gliding 
lim-is-iz-a lim-iz-a glide-absorption 
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3.2 Reciprocal + Short Causative 


Challenges to the CARIP template arise with the reciprocal suffix, which in Lusoga has 
the long reflex -agan- of Proto-Bantu *-an-.! In the next few subsections we will consider 
how the reciprocal combines with its fellow extensions in the CARIP template, including 
both ordering flexibility as well as affix doubling. 

We begin with the short causative, -i-. When used alone, without the long causative, 
we observe flexible ordering possibilities, well beyond what would be expected from the 
CARIP template. In these and subsequent examples, a left bracket indicates the boundary 
between inflectional prefixes and the beginning of the verb stem: 


(9) ‘they make each other sew’ 
a. bà-[tüüng-ágán-y-á  /tüüng-agan-i-a/ RI 
b. bà-[tüünz-ágán-á /tüüng-i-agan-a/ IR 
c. bà-[tüünz-ágán-y-á —/tüüng-i-agan-i-a/ IRI 


In none of (9a-c) does the short causative -i- surface as a vowel. Nonetheless, its presence 
is clearly felt. In (9a) it glides, preceding a following vowel; in (9b) and (9c) it spirantizes 
the final /g/ of /-tuung-/ ‘sew’ to [z] by a general process in the language, and is otherwise 
deleted before the following vowel (of the reciprocal). The reciprocal suffix -agan- does 
not trigger compensatory lengthening when vowels glide or delete before it, as also seen 
in the examples with root-final vowels immediately followed by -agan-, below: 


(10) a. bà-[mw-àgán-á ‘they shave each other’ /-mo-/ ‘shave’ 
b. bà-[ty-àgán-á ‘they fear each other’ /-ti-/ ‘fear’ 


Note that (9c) appears to exhibit two instances of the short causative: root spirantization 
indicates a following short causative, and the glide following the reciprocal also indicates 
a following short causative. These two surface reflexes of the short causative could result 
from input suffix doubling, something that is attested elsewhere in Lusoga, as shown in 
the UR given for (9c). Alternatively, the double reflex of the short causative could be the 
result of a-templatic IR order, in which the single short causative spirantizes the root 
and then the reciprocal is interfixed inside of it, an analysis Hyman has supported for 
Chibemba (7). On this account, short causative doubling (IRI) is illusory. We leave open 
for now whether the IRI ordering is required; what is clear is that both RI and IR are 
possible. 


3.3 Reciprocal + Long Causative 


We turn next to the long causative -is-, which, as we have seen, must co-occur with the 
short causative -i-. The most common realization when reciprocal and long causative 


! While it is marginally possible for the reciprocal and passive to co-occur in some Bantu languages, typically 
with an impersonal subject, e.g. Ndebele kw-a-sik-w-an-a ~ kw-a-sik-an-w-a ‘there was stabbing [stabbed] 
of each other’ (Sibanda 2004: 66), we have thus far not been able to get the two to co-occur in Lusoga and 
will therefore ignore the passive extension in what follows. 
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are both present is for -agan- to appear between -is- and -i-, as in (11a), exhibiting the 
CRI order expected given the CARIP template. However, two other surface realizations 
are also possible:? 


(11) ‘they make each other sew’ 
a. bà-[tüüng-ís-ágán-y-á  /tüüng-is-agan--a/ CRI 
b. bà-[tàüng-ís-ágán-á /tüüng-is-i-agan-a/ CIR 
c. bà-[tüüng-ágán-ís-á /tüüng-agan-is-i-a/ RCI 


In (11b), -agan- follows -is-i- (CIR). In (11c) -agan- precedes -is-i- (RCI). This variation re- 
veals the same freedom with respect to the ordering of the long causative and reciprocal 
extensions that we observed in $3.2 with respect to the ordering of the short causative 
and reciprocal extensions. 

Note that for phonological reasons, it is impossible to distinguish between the inputs 
-is- and -is-i- before -agan-. The reason is that, sandwiched between long causative -is- 
and following vowel-initial -agan-, short causative -i- would glide to -y- and then get 
absorbed into the preceding [s], without leaving a trace. As was seen in (10), compen- 
satory lengthening is not expected before -agan-. However, it can be detected between 
-i- and a FV when an enclitic such as locative class 17 =ko ‘on it, a little’ is added: 


(12) ‘they make each other sew a little’ 
a. bà-[tüüng-Ís-ágán-y-áá -kó  /tüüng-is-i-agan-i-a -kó/ CIRI + encl 
b. bà-[tüüng-Ís-ágan-á -kó /tüüng-is-i-agan-a -kó/ CIR + encl 
c. bà-[tüüng-ágán-ís-áá -kó /tüüng-agan-is-i-a -kó/ RCI + encl 


In (12a), the final length on -aa can be directly attributed to the gliding of the preceding 
-i-, since there is a surface [y], as can be the final length in (12c), where the glide has 
been absorbed into the preceding [s]. Although (12b) does not show a surface reflex of 
the internal -i-, we continue to assume that -is- must be accompanied by -i-, as also 
reconstructed for Proto-Bantu (Bastin 1986). 

While there are three possible realizations when reciprocal -agan- combines with the 
long and short causative suffixes, the preferred surface orders are IRI in (9c), and CRI, 
in (11a). RI and CRI are of course predicted straightforwardly from CARIP, while the 
IR of IRI is not. Both early placement of C (-is-) in the CARIP template and the early 
realization of the first -i- of the hypothesized a-templatic IRI ordering discussed in this 
section are consistent with a generalization that Hyman (2003b: 272) has characterized 
as “causativize first!”: Both -is- and -i- are spelled out early, but later affixation may 
result in two surface reflexes of -i-, either because of interfixation of subsequently added 
extension suffixes or because of outright morphological -i- doubling of the kind seen in 
the Chichewa RAR case illustrated in (5b). 


? Since Lusoga has a /L/ vs. @ tone system (Hyman 2016), only L(ow) vowels are marked with a grave accent 
in underlying forms. Vowels without an accent receive their surface tones by specific rules. H(igh) tone is 
marked with an acute in output forms. 
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3.4 Reciprocal + Applicative 


The CARIP template is complicated further by the behavior of the applicative, repre- 
sented by “A” in CARIP. In all three of the following examples, the transitive verb kub- 
‘beat’ is both reciprocalized ‘beat each other’ and applicativized. Applicative -ir- licenses 
a locative argument, expressed by the enclitic =wa ‘where’. Here again we observe alter- 
native affix orders: 


(13) ‘where do they beat each other?’ 
a. ba-[kub-ir-agan-a =wa AR 
b. bà-[küb-ágán-ír-á =wa RA 
c. ba-[kub-ir-agan-ir-a -wà ARA 


(13a) represents the expected AR order of CARIP, while the RA order of (13b) represents 
an order which is closer to the compositional interpretation of the resulting verb. In 
(13c) -ir-agan-ir- has both the AR and RA orders. The variation between AR, RA and 
ARA orders represents a competition between the demand of the CARIP template for 
one order and the requirement for affixes to appear in a surface order that reflects their 
relative scope. The AR order (13a) is templatic; the RA order in (13b) is a scope-based or 
compositional override. As suggested by Hyman (2003b), ABA affix doubling can thus be 
interpreted as a means of satisfying both template and compositionality considerations; 
ifthe template wants AR and scope wants RA, then ARA, in some manner, satisfies both.? 

An illustrative pair of examples is presented in (14), based on the transitive verb bal- 
‘count’, which is reciprocalized and applicativized. In this instance, applicative -ir- li- 
censes a benefactive object: 


(14) a. ba-bi-[bal-ir-agan-4 AR ‘they count them [inanimate class 8] for each 
other’ 
b. bà-tü-[bál-ír-ágán-á AR ‘they [animate] count each other for us’ ~ 
‘they count us for each other’ 


By varying the animacy of the object pronouns in (14), it is possible to bias the scope 
interpretation of reciprocal and applicative in opposite directions. In (14a) the object pre- 
fix -bi- ‘them’ (class 8) represents an inanimate object such as ébitabo ‘books’ or ébikopo 
‘cups’, hence animate each other’ (referring back to bà- ‘they’) claims the benefactive 
rôle over inanimate -bi- ‘them’. In this sentence the AR order -ir-agan- satisfies both the 
CARIP template and scope: [[count them] for each other]. In (14b), animate first person 
object -tà- ‘us’ preferentially claims the benefactive role over third person -agan-, again 


3 The questions in (13) unambiguously ask where the action took place and could therefore be answered “in 
Jinga” or “in the house". The absence of the applicative in the corresponding question bá-[küb-agan-a =wa 
‘where do they beat each other?’ more narrowly asks what spot or area of the body was hit. An appropriate 
answer would therefore be “on the head”. Finally, the double reflex of applicative -ir- of ARA -ir-agan-ir- 
in (13c) is reminiscent of the double reflex of RAR -an-ir-an- in Chichewa in (5c): the sequence -ir-agan- 
is licensed by CARIP, while -agan-ir- represents the scope override. Concerning ABA suffix ordering, one 
might note that Lusoga (13c) violates Hyman’s 2003 generalization, observable in Chichewa (5c), that AB 
always reflects the scope, while BA is templatic. 
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referring back to ba- ‘they’. The -ir-agan- order in this sentence is also templatic, but 
this time need not reflect scope: Although the preferred interpretation is [[count each 
other] for us], the other scope ([[count us] for each other]) is also possible, though prag- 
matically less likely. It is thus not surprising that the two alternatives are also possible 
in (15) with the same meaning: 


(15) ‘they [animate] count each other for us’ 
a. ba-tu-[bal-agan-ir-a RA 
b. bà-tü-[bál-ír-ágán-ír-áà ARA 


Parallel to (12b,c), (15a) is a scope override, while -ir-agan-ir- satisfies both CARIP and 
scope in (15b). What is surprising is that the same possibilities are at least marginally 
acceptable in (16), both sentences having the same meaning. 


(16) ‘they count them [inanimate cl. 8] for each other’ 
a.  bà-bi-[bál-ágán-ír-á RA 
b. ?? bà-bi-[bál-ír-ágán-ír-á | ARA 


As in (15a,b), the RA sequence occurs perfectly well in (16a), while the doubled RAR 
sequence in (16b) was judged as sounding “Lugandish,’ perhaps OK to use, but seems 
a little funny, “like a foreigner learning Lusoga? While we have an explanation for the 
variation in (13b,c) and (15a,b), neither CARIP nor scope predicts that (16a,b) should be 
possible. We thus arrive at a major divergence from the template + scope approach that 
accounts for the variations considered above in Lusoga, as well as Chichewa, Chibemba, 
and other Bantu languages. We now address why this may be so in the next section. 


4 Inflectional FV suffixes in Lusoga 


In §3 we were largely able to account for surface variations in verb extension order in 
Lusoga by appealing to a tradeoff between the CARIP template and scope considerations: 
While the templatic CARIP is always available and represents the default order of affixes, 
conflicting orders may be licensed by scope, and template-scope interactions can even 
result in ABA sequences. The one major exception concerns cases of atemplatic (A)RA 
-(ir)-agan-ir-, in which a-templatic RA -agan-ir- cannot be said to be a compositional 
override. In this section we show that this unexpected ordering likely owes its existence 
to an optional restructuring of reciprocal -agan-. 

To illuminate this hypothesis, we now turn to the interaction of reciprocal -agan- with 
the set of complementary inflectional “final vowel” (FV) suffixes. Every verb must end in 
one of these. While most verbs end in the default FV -a, specific TAM categories require 
one of two other finals, the FV -e or the FV complex -ir-e, which have the following 
distributions: 
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(17) a. “irrealis” -e  :hortative/subjunctive, affirmative imperative singular 
with an object prefix, affirmative imperative plural, 
negative near future (F1) 

: perfect/today past (P1), yesterday past (P2) 


: elsewhere 


b. “perfective”  -ir-e 
c. “default” -a 


As summarized in (17a) and exemplified in (18), what unifies the uses of -e is its use in a 
subset of irrealis constructions: 


(18) a. bi-[bál-é ‘count them" (singular imperative with an object 
prefix; cf. bàl-à ‘count!’) 
b. mu-[bal-é ‘count (pL) (plural imperative) 
c. tü-[bál-é "let's count!’ (hortative/subjunctive) 


d. ti-bá-á-[bál-é ‘they will not count’ (negative near future F1) 


As per the general Bantu stem structure in (1), the FV follows the verb extensions, e.g. 
applicative -ir- in (19). 


(19) a bi-tu-[bal-ir-€ ‘count them for us!’ 
b. mu-tu-[bal-ir-é ‘count (pl.) for us!’ 
c tü-bà-[bàl-ír-é —*let's count for them!’ 
d. ti-bá-á-tó-[bál-ir-é ‘they will not count for us’ 


However, two options are attested when the extension is -agan-: 


(20) a. u-[bal-agan-é 


u-[bal-agan-é 


u-[bal-é-gan-é 
u-[bal-é-gan-é 


‘count each other!’ 

‘let’s count each other!’ 

‘they will not count each other’ 
‘count (pl.) each other!’ 

‘let’s count each other!’ 


ù-[ 
ù-[ 
tì- E 
ù-[ 
ù-[ 
tì- p: -[ 


bál-é-gàn-é ‘they will not count each other’ 

The expected forms are in (20a), where reciprocal -agan- is followed by FV -e. Surpris- 
ingly, the alternatives in (20b) show the FV -e occurring both before and after the recip- 
rocal. In these forms we have segmented off the first FV as -e-, which means that the 
reciprocal allomorph is -gan- in this context. The alternative would be to recognize a re- 
ciprocal allomorph -egan- which is used whenever there is an upcoming FV -e.* We will 
see in the discussion of perfective -ir-e below that the first -e- is correctly interpreted as 
a copy agreeing with the final -e. 

The same variation obtains when the applicative suffix is present: 


^ Tt is important to note that -e-gan- cannot be used if the FV is -a: 0-kü-[bál-ágán-á ‘to count each other’, 
ba-[bal-agan-a ‘they count each other’ vs. *ò-kú-[bál-é-gán-á, *ba-[bal-é-gan-a. 
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(21) a. mu-bi-[bal-ir-agan-é ‘count (pl) them for each other!’ 

tu-bi-[bal-ir-agan-é ^ "let's count them for each other!’ 
ti-ba-a-bi-[bal-ir-agan-é ‘they will not count them for each other’ 

b. mu-bi-[bal-ir-é-gan-é ‘count (pl.) them for each other!’ 

tu-bi-[bal-ir-é-gan-é "let's count them for each other!’ 
ti-ba-a-bi-[bal-ir-é-gan-é ‘they will not count them for each other’ 


In (21), the applicative -ir- precedes the reciprocal, showing the AR order predicted by 
the CARIP template, but the presence of the FV between the two in the forms in (21b) is 
highly unusual from a Bantu point of view. 

Exactly the same phenomenon of FV doubling occurs with the perfective -ir-e FV 
complex. As in Luganda, Lusoga -ir-e has several allomorphs. These are presented in 
(22) in the form they take prior to the application of phonological rules? 


(22) a. -ire  : after a CV- verb root 


b.  -i- ... -e : when fused (“imbricated”) into a longer verb base 
c. -i-e : after a labial consonant and /n/ 
d. -re : after a fricated consonant [s] or [z], where -i- —^ y —^ Ø 


The above four allomorphs are illustrated in the perfect/today past (P1) tense below: 


(23) /tu-[ti-ir-e/ —  tü-[ti-ir-é ‘we feared’ 
/tü-[tomer-i-e/ —  tü-[tóméir-é ‘we ran into (s.o./sth.)’ 
/tà- [tüm-i-e/ —  tü-[tüm-y-é ‘we sent’ 

ü-[ 


/tà-[bal-i-e/ > baz-é ‘we counted’ 


aot Pp 


tu 
tu 


In (23a), the /-ir-e/ allomorph is realized after the CV verb /-ti-/ ‘fear’. In (23b), longer 
verb bases that end in a coronal consonant undergo imbrication whereby -i- metathesizes 
with the consonant. We will see in further examples that the reciprocal -agan- extension 
also undergoes imbrication to become -again-e. In (23c), the /-i-/ of /-i-e/ glides to [y]. In 
(23d), -i- fricates the preceding /l/ to [z], yielding the same derivation as in (8): /-bal-i-e/ 
— baz-i-e — baz-y-e — baz-e, the [y] being absorbed into the preceding fricative. 

We will now illustrate each of the above allomorphs of -ir-e in (23) as they are realized 
with the reciprocal extension. We start with the reciprocalized version of (23b), which 
exhibits the imbricating -i-e perfective FV allomorph. The historically conservative vari- 
ant, in which the root is followed directly by the reciprocal suffix and then the -i-e FV, is 


5 As was discussed at the end of §2 with respect to Proto-Bantu, we represent -ir-e as bimorphemic. In (22) 
we omit the passive and causative forms that occur with final -a, thereby providing even more allomorphs, 
e.g. the perfective of the lexicalized passive verb /-lum-u-/‘be in pain’ is tu-[lum-iir-w-a ‘s/he was in pain’, 
while the perfect of the lexicalized causative verb /-tém-i-/ ‘blink’ is tà-[tém-ííz-à ‘we blinked’, where r — z 
is triggered by the causative suffix /-i-/. Both occur with a long -iir- morph followed by -a. As seen in these 
examples, the fact that -a is used with passive -u- and causative -i- provides additional evidence that -ir- 
is a separate morpheme from -e or -a. 

The following -e actually lengthens, but then is shortened by a rule of final vowel shortening (FVS), which 


a 


converts d-lim-y-éé to à-lím-y-é. Thus compare the long vowel in éi) Vines éé -kó which is realized when 
an enclitic follows. d indicates a downstepped high tone). 
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shown in (24a). However, the preferred alternative is (24b), in which the perfective -i-e 
appears, imbricated, both immediately following the root AND immediately following 
the reciprocal. URs showing both a single and a doubled FV complex are provided for 
each form: 


(24) ‘we ran into each other’ 
a. /tu-[tomer-agan-i-e/ tü-[tómér-àgàin-é 
b. /tü-[tomer-i-e-agan-i-e/  tu-[toméir-é-gain-é 


A parallel situation obtains in (25), which corresponds to (23c): 


(25) ‘we sent each other’ 
a. /tu-[tum-agan-i-e/ tü-[tüm-àgàin-é 
b. /tü-[tüm-i-e-agan-i-e/ —tü-[tüm-y-é-gàin-é 


Example (26), based on (23d), shows similar facts, the main difference being the frication 
triggered by causative -i- on the verb root -bal- ‘count’: 


(26) ‘we counted each other’ 
a. /tu-[bal-agan-i-e/ tu-[bal-again-é 
b. /tü-[bali-e-gan-i-e/  tu-[baz-é-gain-é 


Finally, in (27), we see a reciprocalized version of the root in (23a), which, on its own, 
would take the -ir-e FV allomorph. The historical variant is shown in (27a), but the 
preferred variant, with doubled FV, is given in (27b): 


(27) ‘we feared each other’ 
a. /tu-[ti-agan-i-e/ tu-ty-again-é 
b. /tu-[ti-ir-e-gan-i-e/  tü-ti-ir-é-gàin-é 


As before there are two instances of the perfective in (27b), vs. one in (27a). In this case 
of doubling, however, the allomorphy of the perfective is different in the two copies. The 
first copy of the FV follows a CV root and assumes the expected -ir-e form; the second 
copy, following the longer -agan-, assumes the imbricating -i-e form. The fact that the 
allomorphs are different suggests that the two copies are generated independently. 

In sum, both the irrealis -e FV and the perfective FV allomorphs can appear once in a 
reciprocalized verb, or twice, with the double spell-out being clearly preferred. We now 
turn to an analysis of these facts in $5. 


5 Towards an analysis 


From the perspective of familiar cross-linguistic principles of affix ordering (derivation 
closer to the root than inflection; prohibition on multiple exponence), Lusoga presents 
two interesting puzzles: (i) derivational and inflectional suffixes both double; (ii) when 
inflectional suffixes double, they do so on either side of derivation, violating the "split 
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morphology" hypothesis. Thus, in a form like tu-[bal-é-gan-é ‘let’s count each other’ 
from (20b), the irrealis FV -e occurs both before and after the derivational reciprocal 
suffix -gan-. While doubling of derivational suffixes has been previously discussed in the 
Bantu literature (Hyman 2003b), the doubling of inflection has not. This is the final focus 
of this study. Given that the doubling occurs in verbs containing the reciprocal suffix 
-agan-, the question we face is what it is about this suffix that triggers the phenomenon. 
Why is it only the reciprocal that does this? 

Our hypothesis is that the phonological form of the reciprocal has led to a reanalysis 
of the internal morphological structure of the reciprocalized Lusoga verb stem. The re- 
ciprocal suffix -agan- is the only Lusoga derivational suffix which is both disyllabic and 
a-initial. Taken together, these phonological facts are consistent with a reanalysis of the 
verb stem in which the reciprocal suffix is bimorphemic, -a-gan. Because of its phonolog- 
ical identity, the -a- portion became identified with the default FV -a. At the same time 
this permitted the reanalyzed reciprocal suffix, -gan-, to conform to the default -CVC- 
verb root structure. 

As a result of this reanalysis, the verb structure in (28a) became reinterpreted as in 
(28b), where we use # to indicate the internal stem boundary: 


(28) a. Expected (inherited) ^ b. Unexpected (innovated) 
Roor-Reciprocal-FV Root-FV#Reciprocal-FV 
Roor-agan-a Roor-a£gan-a 


From this step, the following analogical reanalyses follow straightforwardly, with allo- 
morph variation in (29b) conditioned by the phonological size and shape of the root: 


(29) a. Expected (inherited) b. Unexpected (innovated) 
Roor-agan-e Root-e#gan-e (irrealis) 
Roor-agan-ir-e Roor-ir-e£gan-ir-e (perfective) 


In (29a) inflectional -e and -ir-e are suffixed after derivational -agan-. (We show the 
perfective as -ir-e in the above, although its exact allomorph will vary, as pointed out 
in (22).) In (29b) we see the reanalysis brought on by analogy. As a result, from the 
simple right-branching suffixing construction in (29a), reciprocal verb stems became 
reanalyzed, optionally, as compounding, with two roots: the verb root, and -gan-. Both 
are inflectable (29b), though it is possible also to inflect only the verb stem as a whole 
(29a). 

As indicated, the compounding account allows us to account for the apparent affixa- 
tion of the inflectional suffixes -e and -ir-e inside of a derivational suffix, the restructured 
reciprocal -gan-. These suffixes also potentially precede the short causative -i-. The in- 
flection of stems containing both -(a)gan- and the short causative is seen in the following 
six alternants, based on the causative verb -lum-i- ‘injure’, where -i- glides to [y] before 
the following vowel:’ 


7 Although the verb root -lùm- means ‘bite’, the semantics of the lexicalized causative verb -lim-i- ‘injure, 
cause pain’ is most clearly seen in the corresponding lexicalized passive verb -lùm-ù- ‘to ache, be in pain’. 
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(30) ‘let’s injure each other’ 
a. tü-[lüm-y-ágàn-é 
tu-[lum-agan-y-é 
tu-[lum-y-agan-y-é 
tu-[lum-y-é-gan-é 
tu-[lum-é-gan-y-é 
tu-[lum-y-é-gan-y-é 
The options in (30a) all follow the expected parsing, with -agan- treated as a derivational 
suffix. Those in (30b) represent the claimed restructuring in which the FV -e occurs both 
before and after reciprocal -gan-. In each set, causative -i- appears immediately after the 
root in the first example, after the reciprocal in the second, and both before and after 
in the third. In the last two examples of (30b), the first (inflectional) -e occurs not only 
before -gan-, but also before the (derivational) causative -i- suffix. Parallel cases could be 
illustrated in which -i- combines with the various perfective allomorphs. Our analysis, 
which assumes a double or compound stem structure, each of which is independently 
inflected, thus nicely accounts for the above (and other) cases where the inflectional 
FV linearly precedes (restructured) reciprocal -gan- and potentially other derivational 
suffixes. 


(31) 


Roor-(ext;)-(FV;) gan-ext;-FV; 


Before moving on to our conclusion, we briefly cite phonological evidence for our 
analysis from closely related Lulamogi, which also optionally realizes the inflectional 
FV both before and after reciprocal -gan- (Hyman In press). In this language, there are 
two facts concerning vowel length and (pre-)penultimate position that are relevant to 
the analysis of the reciprocal. First, a word-initial V- prefix lengthens if it is followed by 
a monosyllabic stem (i.e. if it is in penultimate position). This is seen in (32a): 


(32) a. /a-[ti-a/ — aa-[ty-A ‘s/he fears’ 
b. /ba-[ti-Cà/ | —  ba-[ty-4 ‘they fear’ 
c. 


/a-[sék-a/ —>  à-[sék-à ‘s/he laughs’ 


As seen in (32b), if the word-initial prefix has the shape CV-, its vowel doesn’t lengthen, 
while in (32c) /a-/ fails to lengthen because it is in pre-penultimate position. The second 
length-related phenomenon is exemplified in (33): 
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(33) a. /tu-[a-ti-a/ —  tw-áá-[ty-à ‘we will fear’ 
b. /tu-á-[sek-a/ —>  tw-á-[sék-á ‘we will laugh’ 


In (33a), the prefix sequence /tu-á-/ (IPL-FUT) undergoes gliding + compensatory length- 
ening to be realized [tw-áá-] in penultimate position. In (33b), on the other hand, the 
same gliding process applies, but the result is short [tw-a-], since prefixal vowel se- 
quences are realized short in pre-penultimate position. 

A systematic exception to both penultimate prefixal V-lengthening and pre-penulti- 
mate prefixal V+V shortening occurs when reciprocal -agan- is suffixed to a monosyllabic 
verb root: 


(34) a. aa-[ty-agan-a ‘s/he often fears’ 
b. tw-áá-[ty-àgàn-á ‘we will fear each other’ 


In (34a), where -agan- is used as a frequentative suffix, the initial subject prefix à- length- 
ens even though it is in pre-penultimate position. In (34b), the [tw-aa-] sequence remains 
long even though it too is in pre-penultimate position. Note also that the first vowel of 
the -ty-àgàn- sequence is short, i.e. compensatory lengthening appears not to apply. All 
of these observations can be accounted for if we assume the same analysis as in Lusoga: 


(35) a. /a-ti-a#gan-a/ —  aa-ty-agan-a ‘s/he often fears’ 
b. /tu-á-ti-agan-a/  —  tw-áá-ty-àgàn-á ‘we will fear each other’ 


In (35) the # symbol again represents the boundary between the two stems. The result in 
(35a) is that the initial /a-/ is now in penultimate position in the first stem and is thus free 
to lengthen. In (35b) the /tu-á-/ is now also in penultimate position, and so [tw-aa-] fails 
to shorten. Taken alone, either our Lusoga analysis or this Lulamogi analysis of Hyman 
(In press) might seem overly speculative—and especially surprising from a traditional 
Bantu perspective. However, taken together, the two sets of facts support each other. In 
fact, Lulamogi is the only other Bantu language we are aware of that allows the option 
of spelling out the FV both before and after the reciprocal extension. Thus compare the 
following with Lusoga (20a,b): 


(36) "let's count each other’ 
a. tüá-[bàl-àgàn-é 
b. tü-[bàl-é-gàn-é 


As we stated earlier, we think this reconceptualization is due to the fact that -agan- is the 
only highly productive suffix that could be re-interpreted in the way we have suggested. 
It is significant that the historical Bantu reciprocal suffix *-an- often joins with other 
suffixes to make a -VCVC- conglomerate (cf. Bostoen & Nzang-Bie 2010: 1289-91 for 
further discussion). In Lusoga, Lulamogi, Luganda, and many other Bantu languages, 
*-an- has joined with an archaic *-ang- or *-ag- extension which likely had an original 
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pluractional interpretation. As we have suggested, the shape and “weightiness” of the 
resulting -agan- has led to multiple exponence and inflectional “entrapment” within the 
derivational morphology of the verb stem in Lusoga (and Lulamogi). We consider further 
implications in the next section. 


6 Conclusion 


In the preceding sections we have documented multiple exponence of derivational suf- 
fixes (§3) and inflectional suffixes (§4) in Lusoga, and have proposed a restructuring 
analysis of *-agan- > -a-gan- in §5 to account for the multiple copies of the inflectional 
FV in -e-gan- sequences. Harris & Faarlund (2006) discuss instances in which grammati- 
calization of an outer affix “traps” an inner one, with the result that the two affixes occur 
in an unexpected order. Loss of the trapped affix is an attested diachronic repair for this 
“entrapment” situation; doubling (by addition of an outer inflectional affix) is another. 
Lusoga, however, appears to illustrate reanalysis of a different kind, in which an exist- 
ing affix is reanalyzed as a root, and doubling represents agreement in a compounding- 
like structure of the sort proposed by Inkelas & Zoll (2005) for reduplication, in which 
doubled morphemes can also show divergent allomorphy of the kind displayed by the 
perfective complex in Lusoga. If correct, the Lusoga facts are important both from a 
synchronic and diachronic point of view. An historical change of “affix > root would 
contradict the more broadly attested grammaticalization pattern “root > affix (but see 
Norde 2009). Synchronically, multiple exponence of the inflectional ending is quite dif- 
ferent from the doubling of derivational suffixes. While the latter has been interpreted as 
the resolution of a template-scope mismatch, perhaps spelled out cyclically, this cannot 
work for inflectional doubling. In the examples in (30) above, it was seen that the deriva- 
tional causative -i- can appear once or twice: It can appear either before the reciprocal 
(-i-agan-), after it (-agan-i-), or both before or after (-i-agan-i-). However, we have thus 
only shown two possibilities concerning inflectional FVs such as subjunctive -e. In (20), 
repeated as (37a,b), we saw that -e can appear either after -agan- or both before and after 
-agan-: 


(37) a. ü-[bàl-ágàn-é ‘count each other!’ 
ü-[bàl-ágàn-é ^ "let's count each other!’ 
ti- s a-[bál-àgàn-é ‘they will not count each other’ 
b. u-[bal-é-gan-é ‘count (pl.) each other!’ 
u-[bal-é-gan-é "let's count each other!’ 
ti- bid a-[bal-é-gan-4 ‘they will not count each other’ 
c. u-[bal-é-gan-4 ‘count each other!’ 
u-[bal-é-gan-4 "let's count each other!’ 
*ti- p a-[bal-é-gan-4 ‘they will not count each other’ 


8 While the most general realization of the reciprocal is -agan- in Luganda, the form is regularly -annan- 
after CV verb roots, e.g. mw-annan- ‘shave each other’. Since -annan- derives from *-angan- via Meinhof's 
Law (Katamba & Hyman 1991: 192-193), this provides evidence that the earlier bimorphemic form was 
likely *-ang-an- in all three closely related languages. 
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However, (37c) shows that it is not possible to express the inflection only on the first 
stem. These facts motivate the compounding structure we have offered for the Lusoga 
verb stem, and suggest that the second member, on which inflection is obligatory, is the 
head, and agreement in derivational and inflectional properties is optionally enforced, 
explaining the presence of duplicate morphology on the first constituent. The structures 
in (37) are not amenable to a cyclic analysis proceeding bottom-up from the verb root. 

In Lusoga, compounding, derivation and inflection are intermingled in typologically 
unusual ways. The complexities of the system - and of multiple exponence in general 
(Anderson 2015: 21) - give credence to views in which morphology is a component of 
grammar with its own internal morphotactic organization; it does not mirror syntax 
directly and thus cannot be reduced to syntactic principles. This is a result of which we 
think Steve would approve. 
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Chapter 10 


Romansh allomorphy (Again!) 


Martin Maiden 
Oxford University 


This essay resumes a debate which has continued for some years between me and Stephen 
Anderson regarding the correct analysis of a complex set of data from the verb morphol- 
ogy of the Romansh dialect of Savognin. Anderson believes that the data are an example 
of “phonologically conditioned allomorphy", whilst I maintain that they exemplify “mor- 
phomic", or autonomously morphological, alternation patterns, whose only phonological 
motivation lies in diacrhrony. I reply below to Anderson’s most recent analysis of the data, 
by discussing reasons in support of my “morphomic” account. I conclude, however, by con- 
sidering the possibility that our two accounts may be too exclusivist in their respective 
“phonologizing” and “morphologizing” stances, and that they are not necessarily wholly 
incompatible. 


1 Introduction 


Some readers might regard this essay as an example of chutzpah, or downright imperti- 
nence, but it is a sincere mark of respect for Steve Anderson that I feel able to disagree 
with him even in a collection published in his honour. One would not do this with a 
less intellectually generous scholar. What follows is, in fact, a further instalment in an 
amicable difference of opinion I have had with him for some years (see Anderson 2008; 
Anderson 2011b; Maiden 2011b; Anderson 2013) concerning the analysis of a set of data 
from some Romansh dialects, and principally, the Surmiran variety spoken in Savognin. 
Readers, and not least the honorand himself, may be feeling that there is little left to say. 
That there is reason to continue the debate is suggested, for example, by Andrea Sims’ 
deft review of the issues (Sims 2015: 202-206), to which I return in my conclusion. 
Anderson's analysis displays not only his characteristically penetrating theoretical 
rigour, but also a quite formidable grasp of the data. Reasons of space, and the fact 
that the data are lucidly laid out by him in previous publications (e.g., Anderson 2008, 
2011) permit me to do no more here than summarize them: Savognin has a recurrent pat- 
tern of vocalic alternations] in the verb root (or “stem”), such that one alternant occurs 
in the root of the singular and third person forms of the present indicative imperative, 
throughout the present subjunctive, and also in third conjugation infinitives, while an- 
other occurs in the root in the remainder of the paradigm. What is involved originates 
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as alternation in vowel quality, phonologically conditioned by stress. The distinctions 
between stressed and unstressed positions may additionally manifest as differences in 
the number of syllables in the root, and in sundry consonantal alternations, including 
metathesis. In fact, what Savognin (and Romansh generally) exhibits is an unusually 
florid manifestation (in respect of the range of different alternation types involved) of a 
pattern of alternation recurrently attested across Romance languages, and I have argued 
extensively (e.g., Maiden 2005; 2011c) that it arose historically because of stress-related 
vowel differentiation, but then became “autonomously morphological” or “morphomic” 
in nature, being no longer determined by stress, but simply by the heterogeneous set of 
paradigm cells, one property of which is that they are "rhizotonic" (i.e., stressed on the 
lexical root). This set I label (for reasons that are here unimportant) the "N-pattern". Im- 
portant diachronic proof that the N-pattern is independent of phonological causation is 
the rise in various Romance languages of alternation patterns whose phonological con- 
tent cannot possibly ever have been determined by stress (including lexical suppletion), 
but which replicates the pattern originally *etched out" by the effects of stress. The 
distribution of alternation found in Savognin constitutes a widespread variant of this 
“N-pattern”. ! There is no space to recapitulate all the data and arguments, but crucially 
Anderson (e.g., 2010: 25;2011: 3462013: 10,16,23;2011: 173) accepts the “morphomic” na- 
ture of the "N-pattern" and its importance in determining morphological change in the 
Romance verb generally. He believes, however, that modern Savognin (and other Ro- 
mansh varieties: cf. Anderson 2013) is, in effect, a "special case", indeed a prime example 
of ‘phonologically conditioned allomorphy’. He convincingly shows that the alternation 
types involved have often become so disparate, and refractory to unique underlying rep- 
resentation, that they must be represented directly in the grammar, but he also argues 
that the alternant-pairs characteristically contain one member more suited to appear un- 
der stress, and another more suited to appear in unstressed position (Anderson 2011b: 18), 
and that the alternants are accordingly conditioned by the presence vs absence of stress. 
He also shows that the position of stress in Savognin is systematically predictable from 
the segmental content of the word-form. It follows that the alternations are fully pre- 
dictable from the position of stress, and that appeal to the “N-pattern” is inappropriate. 
My response, in nuce, has been that there exist nonetheless within Savognin morphologi- 
cal phenomena that really are irreducibly “morphomic” and follow the N-pattern. Given 
this, I say that Anderson misses an important generalization by divorcing the vocalic 
alternations from clear-cut cases of the N-pattern. Anderson’s response is, in effect, that 
my alleged N-pattern examples are secondary effects of the principle of stress-related 
allomorph selection, and that invocation of the morphome risks missing another signifi- 
cant generalization, namely that the alleged stress-related alternations found in the verb 
are found across the grammar, outside the verb. 

I need to comment briefly on a methodological assumption. Steve Anderson objected 
to me orally some years ago (cf. also Anderson 2011b: 13f.;17) that I could not draw in- 


1 Savognin is among a number of Romance dialects where the N-pattern alternant also appears in the first 
and second persons plural present subjunctive. The reasons are complex and not immediately relevant here 
(cf. Maiden 2012). 
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ferences about Savognin from apparently parallel developments in other Romance lan- 
guages for which my *morphomic" analysis seemed correct, observing quite reasonably 
that the inhabitants of Savognin were “not Romance linguists” (and therefore could not 
know about what happens in other Romance languages). My very delayed response is, 
in effect: “Yes, but they are still Romance speakers”. Let us suppose that in some other 
Romance variety, distant both historically and geographically (a real case is Romanian), 
virtually identical patterns of alternation are found, except that this time they are clearly 
morphomic; let us further assume that the analysis appropriate to, say, Romanian is per- 
fectly possible for Romansh, even though rival possible accounts exist. Obviously native 
speakers of Romansh are not native speakers of Romanian, nor are they Romance lin- 
guists enjoying Olympian vistas over comparative Romance morphology: nothing we 
could say about Romanian could ever be definitively probative for Romansh. What does 
not follow, however, is that the comparative evidence can be ignored. Speakers of both 
languages obviously have the same mental endowment, and both languages have inher- 
ited much that is structurally common, particularly with regard to the organization of 
the inflexional morphology of the verb. What this means is that an analysis which is 
justified for Romanian deserves consideration as plausible for Romansh. The Romance 
languages ought not to be treated as hermetically isolated entities: rather, the analysis 
of one variety should always be allowed to inform that of another. That, in fact, is one of 
the reasons for doing Romance linguistics from a comparative perspective (in fact, there 
is no other way of doing it), and in the following pages the analysis will frequently be 
guided, with all due caution, by comparisons and inferences across cognate but separate 
varieties. I now examine the facts which seem to me to require acknowledgement of the 
morphomic N-pattern in Savognin. 


2 Suppletion: dueir and deir 
The verb dueir ‘must, be obliged to (cf. German sollen) (from Latin DEBERE) plays a 


central role in the debate, because it has a suppletive root allomorph in precisely that set 
of cells which, in other verbs, displays the “stressed” alternant (Table 1): 


Table 1: Dueir in Savognin 


INF duéir 


PSTPART duia 


GER duond 

(Ee 2sG EES IPL 2PL 3PL 
PRS.IND stó stóst stó duágn duéz stón 
PRS.SBJV stopiga  stóptgas stoptga  stóptgan  stóptgas stoptgan 
IPF.IND duéva | duévas | duéva | duévan duévas  duévan 


COND duéss duéssas | duéssa duéssan | duéssas | duéssan 


191 


Martin Maiden 


The suppletive forms are taken from another verb, stueir ‘must, be necessarily the case 
(cf. German miissen)’, which unlike dueir continues to have its own complete paradigm 
(Table 2): 


Table 2: Stueir in Savognin 


INF stuéir 


PST.PART  stuia 


GER stuond 

1sG 2sG 3sc 1PL 2PL 3PL 
PRS.IND stó stost stó stuágn stuéz stón 
PRS.SBJV stóptga  stóptgas stóptga stóptgan stóptgas stóptgan 
IPF stuéva stuévas stuéva stuévan stuévas stuévan 


COND stuéss stuéssas stuéssa stuéssan  stuéssas stuéssan 


For Anderson (2008; 2010) dueir is a defective verb and its pattern of alternation is 
a matter of phonologically conditioned allomorphy: he believes that the explanation of 
the suppletion is that dueir lacks a stressed stem-alternant, having only unstressed /do/. 
Since /do/ contains a vowel whose phonological characteristics debar it from occurring 
under stress, speakers in effect plug the resultant phonological gap by borrowing ap- 
propriate stressed forms from a near synonym of dueir, namely stueir. My view, from 
comparative and diachronic evidence, is that it is highly unlikely that dueir could ever 
have been, in any relevant sense, "defective", and that even if it were, the filling of the al- 
leged gap could have nothing to do with phonology. Indeed, any explanation in terms of 
phonological conditioning crucially fails to account for the fine details of the allomorphy. 
If I am correct, and what we observe is a pattern of allomorphy identical in distribution 
to the vocalic alternations, yet independent of phonology, then in principle whatever 
explains the paradigmatic distribution of forms of dueir should also be available to ex- 
plain the vocalic alternations. Indeed, considerations of economy would lead us to prefer 
that single explanation. This is a view that I have expounded before (e.g., Maiden 2011b: 
46-49), while Anderson, in his latest discussion 2013: 8 states that there are no new 
facts, and that we simply disagree. I think that the facts remain very important, and I 
(re-)present them below in a slightly revised form. 

The 'defectiveness' of dueir is the effect, not the cause, of the suppletion. All sup- 
pletive verbs whose morphology reflects the incursion of forms of the paradigm of one 
lexeme on the paradigm of another are, in one sense, "defective". If, for example, Grisch 
(1939: 89f.n5), DRG s.v. dovair, p.378, Decurtins (1958: 155;158), or Signorell, Wuethrich- 
Grisch & Simeon (1987: 165f), describe Romansh reflexes of DEBERE as “defective”, this 
simply means that there are parts of the paradigm occupied by forms which are patently 
unconnected (diachronically or synchronically) with dueir, and which obviously are con- 
nected with stueir. This does not mean that the paradigm of the lexeme meaning ‘must’ 
somehow has “holes” in it. 
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One might object that the independent existence of stueir as a verb with its own full 
paradigm (and indeed with its own distinctive meaning, as I explain shortly) is grounds 
to view those forms of it which appear inside dueir as synchronic interlopers drafted in 
to occupy otherwise “empty territory”. Such reasoning would force us into the highly 
counterintuitive position of claiming, for example, that Savognin esser ‘to be’ is “defec- 
tive” in respect of its past participle, because the latter has the form sto which is also 
(and transparently) the past participle of a different verb, star ‘to stand’. This is actually 
a case where there was, historically, defectiveness: the Latin ancestor of esser had no 
past participle, for semantic reasons. It is only with the rise in early Romance of verbal 
periphrases comprising auxiliary + past participle, that the verb ‘be’ needs to fill the past 
participle “slot”, and it does so (in some regions) by supplying the missing form from the 
past participle of sTARE ‘stand’. But the idea that, in modern Romansh, the verb "be 
lacks a past participle and “borrows” it from star seems peculiar, given that the verb ‘be’ 
itself, and the use of forms of it involving its past participle, are utterly basic. Indeed, to 
the best of my knowledge no grammarian of the Romance languages has ever described 
the wholesale suppletion of ‘be’ in the Romance languages as involving “defectiveness”. 
If one can analyse Savognin esser as suppletive but not defective, then surely the same 
analysis should be available for dueir. 

My principal difficulty with Anderson’s analysis of dueir is that I see absolutely no 
motivation to view this verb as defective, beyond the morphological facts which are the 
explanandum. All its cells are well and truly filled — and it is almost inconceivable that 
a subset of present-tense cells of a verb expressing such a basic meaning could ever be 
empty. Here, again, a comparative perspective is useful. Virtually all Romance languages 
conserve reflexes of DEBERE, with a full inflexional paradigm, and no Romance languages 
show any sign of defectiveness in the verb meaning ‘must’, whatever its origin: there is 
no reason for parts of its paradigm to be missing. Many Romansh dialects indeed have 
a full paradigm all of whose forms still continue DEBERE (see Decurtins 1958: 152f. DRG 
s.v. dovair), and the rhizotonic forms (usually dé-) are robustly attested from the earliest 
records, including in central dialects (of which Savognin is one); cf. also Anderson (2010: 
30;32). There is simply nothing in the phonology of these dialects, either synchronically 
or diachronically, which could have determined deletion of such stressed forms, and the 
defectiveness certainly cannot be explained as a phonological effect of stress.’ 

Anderson (2010: 32) suggests that “the primary factor in the emergence of defective- 
ness in Surmiran dueir, as well as the complementary pattern in the Engadine languages, 
was the morphologization of the vowel alternations in Swiss Rumantsch. If we hypoth- 
esize that this was combined with reduced use of the verb due to competition with oth- 
ers such as stueir, it could well have led to the present situation with only one stem 


2 Could a stress-based account be salvaged if, unlike Anderson, one said that any kind of alternation, in- 
cluding an alternation where one of the alternants was zero, could be effected by stress? Given that the 
position of stress in Savognin is predictable on grounds of segmental phonological content, one can hardly 
invoke the case of zero alternants which would, by definition, lack any segmental content. The best one 
could say is that zero forms appear in those parts of the paradigm where stress would normally be expected 
to appear. But then one would have to ask: “Where would stress normally be expected to appear?", and 
the answer would be purely morphological: “the N-pattern". 
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conserved”. I discuss later the nature of the “competition” from stueir, which involves 
overlap and replacement, not defectiveness. As for the alternation, regular sound change 
would indeed have given rise to a unique alternation between a stressed alternant /de/, 
and unstressed alternant /du/ (in the latter the back vowel is the result of an adjust- 
ment of the unstressed front vowel triggered in the environment of a following labial 
consonant: cf. Italian 3sc.PRs.IND déve vs INF dovére). But the notion that Romansh 
would eliminate an alternation type because it was “morphologized” (or, perhaps bet- 
ter, idiosyncratic and unpredictable), especially in such a high-frequency verb, seems 
unlikely, particularly given that Romansh is notable for retaining extreme and idiosyn- 
cratic patterns of vocalic alternation, even in isolated verbs which surely have a much 
lower frequency of use. Rogers (1972), in an analysis of Surselvan, lists no fewer than 
eleven sets of vocalic alternation each apparently limited to just one verb, with meanings 
such as ‘vomit’, ‘scythe’, ‘drivel’ - all without sign of resort to defectiveness; see also my 
discussion of Savognin deir ‘say’, below. One might add that the most natural response 
to any idiosyncratic type of alternation would surely be not to create a “gap”, by jetti- 
soning one alternant, but to attempt some kind of “repair” by analogically remodelling 
the alternants to be less different? The last thing one expects is a reaction resulting in 
an alternation which, by virtue of being suppletive, is even stranger than the rejected 
original. 

Viewed in a comparative-historical perspective, the notion that the Savognin reflexes 
of DEBERE could be in any significant sense “defective” seems most unlikely. And even if 
it were defective, the suppletive filling of the alleged gaps would take place because the 
gaps needed filling, not specifically because of the lack of a “stressed” alternant. What 
that perspective does suggest, however, is a different scenario (see also Maiden 2011b: 
46-49), which involves not “gaps” but “overabundance” (cf. Thornton 2011), in which 
more than one realization became available for certain cells of the paradigm of dueir, 
and in which one of the alternative realizations ultimately prevails. This situation arises 
from particular discourse-related circumstances. The reflexes of DEBERE are subject in 
Romansh and beyond to intensive competition from other nearly synonymous alterna- 
tives. I have no space to detail the facts or the mechanisms (see Maiden 2011b),° but 
essentially what appears to be involved is “face-saving”: speakers avoid the charge of 
moral obligation inherent especially in present indicative forms of dueir by resorting to 
alternatives such as expressions equivalent to ‘ought’ (e.g., conditional forms of the verb), 
or expressions meaning “absolute” (rather then “moral”) necessity, which is exactly ex- 
pressed by stueir. This tendency created, I suggested, a situation in which the frequent 
use of stueir alongside dueir in the present tense led to effective synonymy between the 
two forms, eventually resolved by replacing dueir with stueir according to the familiar 
pattern of alternation (the “N-pattern”) associated with vocalic allomorphy (a type of 


3 In any case, Savognin stems do sometimes have “inappropriate” forms. Thus Anderson (2011b: 32) dis- 
cusses verbs such baitár ‘babble’ which has a stem suitable for stress only, but which is is nonetheless used 
throughout the paradigm in unstressed environments as well. 

4 See further Stürzinger (1879: 49); Tagliavini (1926: 84); Kramer (1976: 64); Maiden (2011a) 

5 In addition to the sources cited by Maiden (2011a), see also Pult (1897: 166f.) for the suppletive introduction 
of forms of the verb ‘want’ in the dialect of Sent. 
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reaction to synonymy attested elsewhere in Romance: ct Maiden 2004; 2006). The same 
paradigmatic distribution, reflexes of DEBERE alternating this time with those of conu- 
ENIRE (originally meaning ‘be fitting’), emerges from ALDII (maps 829-833; 836-838), 
for Peio (point 54) and S. Michele all’Adige (point 66). The disappearance of reflexes of 
DEBERE from certain cells of the present indicative and the present subjunctive of dueir 
never involved “defectiveness”, and has never had anything to do with phonology. The 
perception of “defectiveness” is a synchronic effect of the suppletion. 

Crucially, the suppletive fusion of dueir and stueir in Savognin is of a significantly 
different kind from the alleged phonologically conditioned allomorphy of the vocalic 
alternations. The latter is a binary correspondence between alternants and stress: one 
alternant is selected under stress, the other elsewhere. The putative relation between the 
dueir - stueir alternation and stress can only be described as binary at a level which is in 
fact lacking any phonological content. For what, in the case of the suppletion, is allegedly 
selected by stress is not a form correlated with stress, but a whole array of phonologi- 
cally and morphologically different forms. As Anderson himself points out 2008: 124,° 
“it is not just a single stem, but the full range of irregular forms of stueir (ia stò, te stost, el 
sto, els ston; Subjunctive ia stoptga, etc.) that replaces those of dueir where stress would 
fall on the stem”.’ More exactly: “the first and second person singular, and third person, 
forms of the indicative of stueir are mapped onto the first and second person singular, 
and third person, forms of the indicative of dueir, and the present subjunctive cells of 
stueir are mapped onto the corresponding cells of dueir". Only this way do we get the 
observed distribution. In effect, it is not “a stem”, but an entire, morphomically defined, 
“slab”, of the paradigm of stueir, a set of full word-forms replete with their own internal 
allomorphic idiosyncrasies, that has been mapped onto dueir. A rule of phonologically 
conditioned allomorphy involving stress could in principle select a stressed root allo- 
morph of stueir and introduce it into dueir, but it could not necessarily insert the right 
root allomorph in the relevant cell. A rule that identifies a morphologically-defined por- 
tion of the paradigm as that in which the replacement of one lexeme by the other can 
do just that. 

Dueir exemplifies lexical suppletion (“incursion’, in the terminology of Corbett 2007), 
where one historically distinct lexeme obtrudes on the inflexional paradigm of another 
lexeme. Another diachronic route to suppletion is regular sound change so extreme 
in its effects that the synchronic result is allomorphy such that the alternants bear no 
phonological relation to each other. Savognin has at least one case of phonologically 
induced suppletion, namely deir ‘say’, which inflects as in Table 3: 


5 Anderson (2010: 29) even calls this verb “suppletive”. 

7 As mentioned earlier, if we ask “where stress would fall on the stem", the answer is ineluctably morpho- 
logical: in the cells of the singular and third person present and imperative and in the present subjunctive. 
It seems to me useless to say, instead, “wherever the endings would be unstressed” because the third per- 
son singular has no ending, and the distribution of the remaining, unstressed, endings turns out to be the 
morphomic N-pattern. 
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Table 3: Deir in Savognin 


PST.PART  détg 


GER schónd 

1sG 2sG 3sc 1PL 2PL 3PL 
PRS.IND déi déist déi schágn schéz déian 
PRS.SBJV  schéia | schéias ^ schéia ^ schéian ` schéias ` schéian 
IPF schéva | schévas | schéva | schévan | schévas | schévan 


COND schéss schéssas | schéssa | schéssan  schéssas | schéssan 


I cannot here retrace the phonological history of this verb in detail. Suffice to say 
that the historically underlying root was *dik-, and that sound changes involving, in par- 
ticular, deletion of unstressed vowels and assimilation of resulting consonant clusters, 
created the modern situation. The rather unusual present subjunctive of this verb hap- 
pens to show the effects of analogical levelling in favour of the originally arrhizotonic 
first and second person plural form stems togther with the associated stressed inflexional 
ending (cf. Decurtins 1958 for the reflexes of AMBULARE / IRE, DICERE, UENIRE, HABERE, OF 
SAPERE in Samedan, Parsons, and Razen for other Romansh examples of this kind; further 
discussion also in Maiden, in progress). That aside (and there is every reason to believe 
that the déi- root originally occurred as expected in the present subjunctive), this verb 
shows N-pattern suppletion. Anderson (2011b: 17) acknowledges that it is “genuinely 
suppletive" and that "the choice of stem is determined by morphosyntactic features". He 
defines in the same way some other, less radically suppletive verbs, for which I give here 
just the present-tense forms, e.g, (vu)léir ‘want’, néir ‘come’ (Table 4): 


Table 4: (Vu)leir and neir in Savognin 


PRS.IND vi vót vót léin léz vóttan 


PRS. SBJV viglia  víglias viglia —víglian ` víglias ` víglian 


PRS.IND vign vígnst vigna nin níz vígnan 


PRS.SBJV  vígna  vígnas  vígna  vígnan  vígnas  vígnan 


If we acknowledge that deir and some other verbs have (near-)suppletive patterns de- 
termined synchronically by morphosyntactic features, then we have to admit the pres- 
ence of the morphomic N-pattern in Savognin. Yet if we say that the vocalic alternations 
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are still a matter of “phonologically conditioned allomorphy”, then the fact that they 
show exactly the same paradigmatic distribution becomes uncomfortably coincidental. 


3 The “augment” 


The “augment” is a functionally empty formative which, in certain cells of the inflexional 
paradigm of the verb, occurs between the lexical root and desinences denoting tense, 
mood, person, and number (for detailed discussions of its nature and origins, which lie 
in Latin and proto-Romance Aktionsart suffixes, see especially Maiden 2003;2011: 249- 
53;2016: 715f.). In Latin, the relevant affixes were restricted to imperfective-aspect forms, 
but had no restrictions for person, number, or tense. In most Romance languages, aug- 
ments are associated with particular inflexion classes (in Romansh, usually the first and 
fourth conjugations), and have become restricted to certain cells of the inflexional pa- 
radigm defined by tense, mood, person, and number. In Savognin, the augment occurs 
solely in the singular and third person forms of the present indicative, and in all forms 
of the present subjunctive. That is to say, of course, that it has exactly the same paradig- 
matic distribution as the "stressed" vocalic alternants, a fact which clearly suggests a link 
between them. Thus first conjugation luschardár ‘strut’, and fourth conjugation tradéir 
‘betray’ (Table 5): 


Table 5: The augment in Savognin first and fourth conjugation verbs 


1sG 2sG 3sc 1PL 2PL 3PL 


First conjugation 


PRS.IND — luschardésch ` luschardéschas —luschardéscha ` luschardágn luschardéz luschardéschan 
PRS. SBJV  luschardéscha ` luschardéschas ` luschardéscha ` luschardéschan ` luschardéschas ` luschardéschan 
IPF.IND luschardéva luschardévas luschardéva luschardévan luschardévas luschardévan 


Fourth conjugation 


PRS.IND — tradésch tradéschas tradéscha tradígn tradíz tradéschan 
PRS.SBJV  tradéscha tradéschas tradéscha tradéschan tradéschas tradéschan 
IPF.IND tradíva tradívas tradíva tradívan tradívas tradívan 


It is undisputed that the distribution of the Romance augment cannot be explained, 
diachronically or in modern varieties, as the output of any kind of phonological process. 
The view that I have developed (see, e.g., Maiden 2005; 2011b,c) is that the redistribu- 
tion of the alternant from Latin to Romance is purely morphologically determined, and 
reflects sensitivity to a paradigmatic pattern created, originally, as an effect of vocalic 
alternations between stressed and unstressed vowels: the same pattern can be shown 
to have provided a "template" for the suppletive merger of distinct lexical verbs in var- 
ious Romance languages (notably, the verb ‘go’). I see no reason why what we see in 
Savognin, and more generally in Romansh, should be viewed any differently: the distri- 
bution of the augment appears a matter of pure morphology, and given that the vocalic 
alternations have the same distribution as the augment, they too can be accounted for 
in the same, purely morphological, terms. 
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Anderson views the facts, in effect, in terms of a kind of “defectiveness”: verbs show- 
ing the augment lack a stressed vocalic alternant, and the augment is inserted wherever 
this occurs. Since the augment is inherently stressed, the preceding root-form must be 
unstressed, and the lack of a stressed root allomorph is thereby resolved. My view is 
that this analysis inverts cause and effect: it is not the case that the augment is applied 
because there is no stressed root allomorph but, rather, that there is no stressed root al- 
lomorph because the relevant cells of the paradigm are specified as being filled by forms 
containing the augment. This latter analysis has the immediate advantage of avoiding 
the problem of arbitrary stipulation of defectiveness in one set of cells only. After all, if 
stressed alternants can be defective, why should not unstressed alternants too? Why do 
we not also find, that is, verbs with a stressed alternant but not an unstressed one? And 
if the distribution of the augment is dictated by the need to plug a phonological “gap”, 
how is it that such gaps only occur in first and fourth conjugation verbs, precisely the 
inflexion classes to which the augments are historically restricted across the Romance? 

Discussion of the Savognin augment has tended to focus on first conjugation verbs, 
where it is most productive, but where it still only constitutes a subset (and apparently a 
minority) of such verbs. We should not forget that the augment also appears in the great 
majority of fourth conjugation verbs (characterized by infinitives in -eir), a class com- 
prising dozens of lexemes and endowed with some productivity. If we follow Anderson, 
this means that almost all of the fourth conjugation is characterized by lack of a stressed 
alternant. Nothing logically excludes this, but it seems counterintuitive to say that a 
major, semi-productive, inflexion class is, in effect, “defective” in most of its present 
tense. Nobody would countenance such an analysis for the cognate Romance varieties 
(Daco-Romance, Italo-Romance, Catalan) where the fourth conjugation behaves in this 
way. 

Anderson (2011b: 22) points out that the augment frequently appears in neologisms, 
including where speakers feel doubt about the identity of the stressed allomorph. This 
does not mean, however, that the augment is usually a response to perceived lack of a 
stressed alternant. Using the Savognin first conjugation verb luschardar ‘strut’ (exem- 
plified above), Anderson (2008: 122) observes that: "The use of this pattern [...] has the 
advantage that the speaker does not need to retrieve any information about the specific 
alternation pattern of the stem in order to produce all of the correct forms. Otherwise, 
it would be necessary to choose [...] among a variety of possibilities such as "luscharda, 
*luscheirda, * luschorda, *laschurda, *laschorda, etc. Each of these patterns is more or less 
secure with reference to at least some verbs in the Surmiran lexicon, but the availabil- 
ity of the paradigm [given above] makes it possible to avoid the choice when positive 
evidence is not readily available.” The problem here is that that there is unequivocal ev- 
idence for the stressed vowel. This verb is transparently and directly derived from the 
nominal luschard ‘dandy, fop, vain, proud’, which actually contains, moreover, a highly 
frequent stressed pejorative suffix -árd. In this case, the identity of the "right" stressed 
alternant is patent. This is in fact true of a large number of other verbs that take -esch, 
all transparently derived from nouns or adjectives whose stressed vowel is known (ex- 
amples from Signorell 2001), such as those give in Table 6: 
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Table 6: Nouns, adjectives, and derived verbs in Savognin 


Basic noun/adjective Infinitive 3sc.prs.ind 

cisél ciselár ciseléscha ‘chisel’ 
dimóra? dimorár dimoréscha ‘dwell (-ing)’ 
discrédit discreditar discreditéscha ‘discredit’ 
fax faxár faxéscha ‘fax’ 

figura figurar figuréscha ‘figure’ 

film filmár filméscha ‘film’ 

firma firmár firméscha 'sign (-ature)' 
guíd guidar guidéscha ‘guide’ 

liber liberar liberéscha ‘free’ 

nivél nivelár niveléscha ‘level’ 

ódi odiiér odiéscha ‘hate’ 

penél penelár peneléscha ‘paint (-brush)’ 
schicána schicanár schicanéscha ‘chicane’ 
teléfon telefonár telefonéscha ‘telephone’ 
uniform uniformar uniforméscha ‘uniform’ 
vagabund vagabundár vagabundéscha ‘bum’ 


What such derived forms lack is not a “stressed” alternant but an unstressed one: what 
appears in the verb is simply the root of the corresponding rhizotonic nominal form. 
Yet there is no sign of attempts to invent a predictable “unstressed” counterpart for the 
stressed vowel of the base-form (e.g., (Nr ““udiiér from the noun ódi after the model 
of 3sc.PRs.IND dórma ‘sleeps’ vs INF durméir) and quite simply the derived verb-forms 
preserve the segmental identity of the base form. Many scholars have suggested that in 
Romance generally the augment serves to obviate root allomorphy that might otherwise 
occur in the lexical root. There are various reasons why this view does not account 
for the facts (cf. Maiden 2011c: 251f.), but note that in any case this kind of “solution” 
comports a paradox: one type of alternation is merely replaced by another, that between 
the augment and its absence, the augment itself retaining an irreducibly “N-pattern”, 
morphomic, distribution. 

Anderson (2013)broadens his survey beyond Savognin, arguing for a similar analysis 
for other Romansh varieties. In fact, in dialects where stress has a somewhat different 
distribution from Savognin, the augment duly follows that distribution. Thus Anderson 
(2013: 21f.), in Vaz (Valbella; data from Ebneter 1981) the first person plural present indica- 
tive, in addition to arrhizotonic forms, as in Savognin, also has rhizotonic forms with un- 
stressed desinence -an, and the predicted “stressed” stem: e.g., INF amblidár ‘forget’, 1sc 


8 In fact, dimóra and firma below may be derived from the corresponding verbs (cf. Thornton 2004: 517, and 
Cortelazzo & Paolo 1988 ss.vv. dimorare, firmare for this type in Italian). If this holds for Savognin, then 
these verbs do possess a 'stressed' alternant. 
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amblóid, (et, amblidáin, but INF amprastar ‘lend’, 1sc amprést, 1PL ampréstan. Sometimes, 
the same verb has two possible first person plural present indicative forms, e.g.: INF 
gudair ‘enjoy’, 1sc giód, 1PL gudáin / giódan. In verbs taking the augment, there are, cor- 
respondingly, forms in 1Pr. -áin without augment (e.g., INF adorár ‘adore’, 1sc adorésch, 
1Pr adoráin), and forms in 1P1 unstressed -an duly showing it (e.g., INF habitár ‘inhabit’, 
1sc habitésch, 1p. habitéschan). According to Anderson (2013: 23) such behaviour poses a 
"problem" for the morphomic account, because “it is fairly clear that the stem alternation 
and the appearance of -esch [...] are tied directly to the position of stress, even where this 
is potentially variable, and not to a fixed set of morphological categories". With respect 
to Anderson, I think that it poses no such problem: all it shows is that whatever principle 
governs the stressed root allomorph also governs the augment. In this particular case, 
in fact, we are not dealing with a change in position of stress at all: rather, we have a 
syncretism such that the first person plural form tends to be "taken over" by the third 
person plural. This is quite systematic in Vaz (and elsewhere), and occurs independently 
of stem stress (for example, in non-present forms). 

More revealing is the case presented in Maiden (2011b: 45f.), of distributional discrep- 
ancy between augment and stressed vocalic alternant. The dialects of the Val Müstair (see 
Schorta 1938: 132) tend to place stress on the root of the infinitive in all conjugations.? 
Indeed, they are unique among Romance languages in generally having rhizotonic in- 
finitives even in the first conjugation, as shown in Table 7: 


Table 7: Rhizotony in Val Müstair first conjugation infinitives 


ARARE >  'aror ‘plough’ 


CAPTARE > 'catər ‘find’ 

FILARE > filer ‘spin’ 
IEIUNARE > jaynor ‘fast’ 
LAETARE >  laidor ‘spread dung’ 
PESCARE > ‘pefcar ‘fish’ 
SCOPARE > Tkuar ‘sweep’ 
tittare > 'tetor ‘suckle’ 
IANTARE >  jaintor(orjantar) ‘breakfast’ 
telefonare > _ telefonar ‘telephone’ 


Schorta states, however, that root stress in first conjugation infinitives systematically 
fails to happen in that class of first conjugation verbs having the element -aj-, which I 


? Some second conjugation verbs are exceptions. 

10 This phenomenon is mentioned by Stürzinger (1879: 35) and Huonder (1901: 518f.), and is amply confirmed 
by data from Val Müstair covered by ALDI/II. See also Grisch (1939: 222) for Vaz, Candrian (1900: 51) for 
Stalla, Solér (1991: 135) for Schams. 
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identify as the augment." Here, stress always remains on the ending of the infinitive: 
e.g., INF betjar ‘baptize’, 3sc.prs betjaja; INF biar ‘build’, 3sc.prs biaja; INF gujar ‘dare’, 
3sc.PRs gujaja. The same holds of fourth conjugation infinitives, but apparently only 
if they belong to that minority of verbs that lack the augment: Schorta cites INF finir 
finish’, a verb which takes the augment; compare this with, e.g., INF bwoAar ‘boil’ a 
verb that does not show the augment). Now the most likely explanation of why the 
augment does not appear in the infinitive here is that, in Romance languages generally, 
root-stress in infinitives is limited to third conjugation verbs (cf. Maiden 2011c: 201f. 
2016: 509), all other classes having non-rhizotonic infinitives. The augment, however, is 
characteristic solely of the fourth and first conjugations, not the third. The third, while 
a relatively small and unproductive class, contains some of the semantically most basic, 
and highest frequency, verbs, and the appearance of root stress in the Val Müstair fourth 
and first conjugations is almost certainly modelled, therefore, on the third conjugation. 
If no augment can appear in first conjugation infinitives, it is because the distribution 
of the augment is morphologically specified, and that specification happens to exclude 
infinitives. 

In Maiden (2011b) I argued as follows: if the function of the augment is in effect to 
supplete the absence of a stressed root-alternant, and if infinitives in the Val Müstair are 
generally root-stressed, then we should expect verbs with the augment duly to show that 
augment in the infinitive, in lieu of root-stress (e.g., INF ““be'tjajar rather than the actu- 
ally occurring be'tjar). The fact that the augment never appears in the infinitive therefore 
also suggests that its distribution is independent of stress, and purely morphologically 
specified. The only way to “save” the stress-based account (and this is what Anderson 
2013 does) is to claim that the augment is inherently limited to “tensed” forms, and is 
therefore not available for the infinitive. He observes, in support of his view, that it does 
not occur, either, in participles or in “related non-verbal forms”. Since the augment also 
appears in second person singular imperatives, it is not clear that “tensed” is quite the 
right term: it might be more accurate to say that the domain of the augment involves 
cells with values for person and number. But now Anderson’s claim must be that the 
augment is selected in those parts of the paradigm specified for person and number 
where stress would otherwise fall on the root, while my account can easily be reformu- 
lated, if we wish, as also applying to those parts of the paradigm specified for person and 


H T must acknowledge here a different, and very careful, analysis of these facts by Kaye (2015: 291-310), 
for whom -aj- is not the 'augment' but part of the stressed lexical root of the verb, whose historically 
regular unstressed counterpart is -j- ("bate djare > betjar; "batedja > *be'taja). In a form such as be'tjaja, on 
Kaye's account, the unstressed element -j- has been analogically generalized into the root of the stressed 
alternant, originally of the type *betaja < *batedja (Kaye 2015: 307); resistance of betjar (< "bate djare) 
to stress retraction is then, Kaye suggests2015: 309, a function of the degree of phonological difference 
between stressed and unstressed root. I would observe that -aj- is exactly the expected reflex of the proto- 
form of the augment (although rarely attested in the rest of Romansh, where is has been supplanted by 
-ef-), and that it is not clear why the root found in the root-stressed present-tense forms of the verb would 
be phonologically disfavoured in root-stressed infinitives. In fact, even if -aj- turns not to be, in origin, 
an “augment”, such an analysis suggests that speakers have effectively analysed betj- as the lexical root, 
treating -aj- as a kind of excrescent element following it, and one that occurs just in the N-pattern cells. 
That is to say that its synchronic status is equivalent to that of the augment in other verbs. 


201 


Martin Maiden 


number, except for the first and second persons plural present indicative. Both accounts 
acknowledge that the phenomenon is heavily morphologized, and applies over a domain 
whose definition corresponds to no natural phonological or morphosyntactic class. Even 
Anderson’s account is, I submit, implicitly “morphomic”. 


4 The generality of the alternations: derivational 
morphology 


Anderson’s analysis gains support from the fact that the vocalic alternations that occur 
within the verb also occur outside it: nouns and adjectives with stressed derivational 
affixes show the corresponding “unstressed” vocalic alternants in the derived forms. The 
sensitivity of these alternations to stress is thereby argued to be a general property of the 
grammar, and not a peculiarity of verb morphology.morphology Take, for example, the 
behaviour of the vocalic alternants in derivational morphology (Table 8), as presented 
by Anderson (2011: 28-30;2013: 13-17): 


Table 8: Vocalic alternants in Savognin derivational morphology 


Verb Basic noun Derived nouns 
Infinitive SSG.PRS.  PST.PART 
guttár'drip ^ gotta gót ‘drop’ gutélla ‘drip’ 
liiér ‘bind’ léia léia ‘union’  liadéira ‘binding’ 
néiver ‘snow’ néiva navia néiv ‘snow’  naváglia ‘big snowfall’ 


In reality, this might be no more than the residual, and synchronically more or less 
accidental, effect of historical differentiation of vowel quality according to stress. This is 
suggested by the fact that there are derived forms with stressed suffixes (and therefore 
with unstressed roots) where, nonetheless, the stressed alternant occurs. Thus Table 9: 


Table 9: Discrepancy between vowel alternation in verbs and derived forms 


Verb Derived forms 
Infinitive 3SG.PRS. 
satgér ‘dry’ sétga 
accumpagnér ‘accompany’ | accumpógna 
durméir ‘sleep’ dórma 
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Similar phenomena recur elsewhere in Romansh, as Anderson (2013: 20) points out: 
thus Vallader has scóula ‘school’, scolar ‘to school’ (3sG.PRS.IND scóula), scolazitin 'edu- 
cation’, yet diminutives scoulina ‘kindergarten’, scoulétta ‘craft school’. Anderson (2011b: 
28) deals with such apparent counterexamples by assuming an architecture of the gram- 
mar in which morphology and phonology “interact cyclically (with some appropriate 
subsystem) of the phonology applying to adjust the results of each stage of morpholog- 
ical elaboration of a form”. The selection of the stressed or unstressed stem alternant 
operates only during the “first cycle”; once the stem-shape is determined ‘the decision 
is not revisited on subsequent cycles’. A “stressed” base will then persist through later 
cycles, even if it is itself no longer stressed: the derivational counterexamples can now 
be explained in terms of the cycle on which they occur. 

One immediate objection is that saying that an apparently phonological phenomenon 
is confined to a particular “cycle” is in fact to concede that it is “morphologized” (the 
cycles being defined, precisely, as stages of “morphological elaboration of a form”), and 
restriction of a phonological rule to some morphologically defined domain is to intro- 
duce into the analysis a considerable degree of arbitrariness (precisely one of the things 
that for Anderson consitutes an objection to the purely *morphomic" analysis). Things 
look even more arbitrary, and “morphological”, if we consider that we now have to say 
that the domain of the phonologically conditioned allomorphy is defined over two quite 
disparate sets of forms: "tensed" forms of the verb (at least in Vallader) and the "first 
cycle" in derivation. A more fundamental difficulty is that it is not always true that the 
"stressed" stem persists unchanged after the "first cycle": why do we have, say, derived 
forms accumpagnedér or durmiglión with “unstressed” alternants, yet accumpognamáint 
or dormulént with “stressed” alternants?? Actually, the predicted selection of the “un- 
stressed" alternant usually occurs in words belonging to inherited vocabulary, but not 
in neologisms, which led me to conclude (Maiden 2011b: 41) that what we have is evi- 
dence of the “death” of phonological selection of the allomorphs, now reflected only in 
traditional vocabulary. This claim tends to be reflected in the behaviour of adjectives 
showing reflexes of Latin -ABILIS, -IBILIS (equivalents of English -able, -ible): the “pop- 
ular” reflex by direct inheritance from Latin (-evel) displays the “unstressed” alternant 
(e.g., ludével ‘praiseworthy’), while the “learned” -ábel/-íbel displays the “stressed” alter- 
nant (e.g., accumodábel 'accommodatable"). One might, perhaps, want to assign -evel to 
the “first cycle”, and -abel to the second, but even this does not work too well, for we 
find occasional examples of the distinctive “unstressed” alternant with -ábel/-íbel: e.g., 
schliibel ‘soluble’, bavábel ‘drinkable’, purtábel ‘portable’, duvrabel ‘usable’. 

Anderson (2013: 15) observes an “asymmetry”, in that the counterexamples to his claim 
all involve the appearance of a “stressed” stem that does not bear stress, while no exam- 
ples exist in which an “unstressed” stem appears under stress. But this is not proof that 
that the stem alternants are sensitive to stress. On such evidence as Anderson presents 
(and from Signorell 2001), the small inventory of possibly derived forms involving a 
stressed stem displays that stem simply because it is the phonologically regular result 
of their etyma (e.g., prescháint ‘present’ (adj.) < PRAESENTEM). In any case, it is perfectly 


12 For an inconclusive discussion of these data, see Wolf (2013: 171). 
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possible that some forms such as prescháint are not “derived”, but are the base forms 
from which the corresponding verbs are derived. Anderson also observes 2013: 18 that 
in Surselvan (in fact, more widely) only “unstressed” root alternants appear in factive 
verbs formed with the suffix -ent- or -ant-,? where the lexical root is systematically un- 
stressed (e.g., INF béiber ‘drink’, 1PL.PRS buéin, but factive INF buentár ‘cause to drink’, 
never ““beibentar). He cites similar phenomena in Puter (Anderson 2013: 10), such as 
stanglantér from stáungel ‘tired’, but says that "[m]ore research is needed to establish 
the generality of the phenomenon". Here I concur: we need to be sure that selection of 
the "unstressed" stem is synchronically productive and therefore psychologically real. 
Otherwise, all we may have is the regular, lexicalized, outcome of old sound changes in 
the relevant derived forms. In any case, there do seem to be examples of factive verbs in 
-entar that bear the “stressed” root allomorph. Jaberg (1939: 291f.) lists Surselvan exam- 
ples most of which indeed bear the “unstressed” allomorph, but also gives dormentár ‘put 
to sleep’ (cf. nF durmir ‘sleep’, 1sc.PRs.IND dorm), and a case in which the unstressed 
vowel of the derived form, while phonologically plausible, does not correspond to that of 
the base verb (scumpentár ‘cause to be saved, heal’, from INF scampár ‘save’, 3SG.PRS.IND 
scómpa). 

Even leaving counterexamples aside (and I acknowledge that they are not many), it 
is not obviously necessary to invoke stress: given that the vast majority of cells of the 
inflexional paradigm of any Romansh verb are arrhizotonic, one might equally say that 
what we call the *unstressed" stem is the default, on which affixally derived forms are 
usually built. An apparent counterargument to such an approach might come (cf. Ander- 
son 2013: 17) from the fact that, in cases of derivation where the stress falls on the root, 
it is always the "stressed" allomorph that appears: e.g., Surmiran cumónd ‘order’ (cf. INF 
cumandár, 1sc.PRS.IND cumónd), clóm ‘call’ (cf. INF clamár, 1sG.PRS.IND clóm), gartétg 'suc- 
cess’ (cf. INF gartagér, 1SG.PRS.IND gartétg), dórma ‘narcotic’ (cf. INF durmir, 3sG.PRS.IND 
dórma), stéma ‘esteem’ (cf. INF stimár, 1SG.PRS.IND stéma). Significantly, however, Sig- 
norell, Wuethrich-Grisch & Simeon (1987: 51) describe such forms, pre-theoretically, as 
“deriving from a finite form”: what we may have here is simply nominalization of first or 
third person singular verb-forms (which happen to contain stressed roots), not a deriva- 
tional process which specifies a stressed root, and thereby must select the “stressed” 
alternant. In short, while it is indeed true that many patterns of root-alternation found 
in the verb recur across the grammar, this is largely a historical residue, not necessarily 
evidence of an active synchronic phonological principle. 


5 Conclusion 


Anderson (2013: 23) accepts that “morphological categories play a role (e.g. in constrain- 
ing the appearance of -esch to tensed forms of first and fourth conjugation verbs)”, but 
asserts that ‘there is no warrant for invoking the further step of complete and arbitrary 
morphological categorization that would be implied by associating the variation with 


13 Cf. Signorell, Wuethrich-Grisch & Simeon (1987: 103f.). 
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a morphome’. I suggest that the data are in fact already inextricably permeated with 
“arbitrary” morphological specifications: in Savognin, and in Romansh at large, the mor- 
phomic N-pattern is really present. The need to specify that the alleged phonological 
principle only applies to “tensed” verb forms (for Val Miistair), or to certain levels of 
derivational morphology makes that principle itself “arbitrary”. Given that the behaviour 
of suppletion in dueir and the distribution of -esch are, as I have argued, incompatible 
with the “phonological” account, attempts to account for the identically distributed vo- 
calic alternations in phonological terms become superfluous. Finally, given that count- 
less Romance varieties do have genuinely morphomic patterns of the kind attested in 
Savognin, treatment of Savognin as a special case is what may be “unwarranted”. And 
yet.... 

I do not think that Savognin can be presented as the perfect example of “phonolog- 
ically conditioned allomorphy” that Anderson claims, and yet one must ask whether 
Anderson's insight - that right across the grammar there is a close correlation between 
stress and the selection of alternants, might be at risk of being abandoned too lightly. My 
criticism has been that there exist some cases where such an analysis does not “work”, 
and that since we need independently to invoke the N-pattern even for Savognin, we 
should do so for all types of alternation which follow that pattern. But there is an un- 
spoken assumption here which may need to be challenged, and it involves what might 
be described as the “ghettoization of the morphomic". The classic examples of “mor- 
phomic" phenomena as adduced by Aronoff (1994) make the case for the existence of 
a “morphomic level" of linguistic analysis precisely because they are not plausibly ex- 
plicable as effects of phonological, syntactic, or semantic conditioning: they are cases 
of “morphology by itself”. In morphologists’ enthusiasm to assert the existence of gen- 
uinely morphological phenomena, much weight has been placed on the notion of the 
"autonomy" of morphology (witness the titles of Maiden 2005, or Maiden et al. 2011). 
While there are very good reasons to proclaim loudly that “autonomously morpholog- 
ical" phenomena exist, the search for them should not become a reductivist obsession, 
nor is there any good reason to suppose that there cannot exist phenomena which con- 
tain a very high degree of purely morphological determination, while yet also possessing 
some degree of phonological or other conditioning. 

The seeds of a possible compromise appear in Maiden (2013) (actually, in the same 
volume as, and immediately following, Anderson 2013). This deals with patterns of con- 
sonantal alternation in Italian verbs historically caused by two different kinds of palatal- 
ization. Synchronically, the result is that phonologically quite disparate types of alter- 
nant tend overwhelmingly to conform to a common distribution such that one alternant 
occurs in all (or most) forms of the present subjunctive, and in the first person singular 
and third person plural forms of the present indicative, but nowhere else in the para- 
digm. There are powerful arguments (see, e.g., Maiden 1992,20112011: 205-63) to say that 
this pattern has lost all phonological causation and is genuinely morphomic. In Maiden 
(2009) I had been extremely critical of attempts by Burzio (2004) to force a synchronic 
phonological analysis of the modern Italian facts, quite often by what is, in effect, the 
illegitimate resurrection of long dead phonological conditioning environments. For the 
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most part, these are criticisms I stand by: it cannot be said too often that morphology 
suffers from a kind of “phonologizing bias” which too readily dismisses morphological 
analyses of the data, and is far too prone to give credence to phonologically-oriented ac- 
counts, even at the expense of postulating conditioning environments lacking plausible 
synchronic justification. Burzio’s analysis appeared to me an example of this kind, but 
observation of some of the fine details of the diachrony of the alternations at issue later 
caused me to moderate my view. 

One type of alternation involved an opposition between velar consonants and palatal 
affricates, the latter arising, historically, through palatalization and affrication of velars 
before front vowels. Now it is beyond reasonable doubt that there has existed no produc- 
tive process of palatalization/affrication of velars before front vowels for over a millen- 
nium, such a putative process being massively counterexemplified by the existence of 
unmodified velars before front vowels from the time of the earliest documents, includ- 
ing within the paradigm of the verbs at issue. The principal fact!^ which made me revise 
(Maiden 2013) the “morphological exclusivism” of my earlier treatments of the subject, 
however, was the observation that in medieval Italian a certain type of analogical innova- 
tion affecting verbs displaying the relevant types of alternation, whereby the root of the 
present subjunctive was optionally extended into gerund forms with the ending -endo, 
was strikingly, and systematically, blocked just where the result would have been a velar 
consonant followed by a front vowel. Informally: “don’t allow a velar alternant before 
a front vowel if an alternative (and more phonologically “natural”) palatal alternant is 
also available". Thus Table 10 (where “**” means “not occurring"): 


Table 10: Analogically reformed subjunctives in old Tuscan 


subjunctive inherited gerund gerund analogically reformed on subjunctive 
possa ‘can’ potendo possendo 

ve/dd;/a ‘see’ vedendo ve/ ddz/endo 

te/np/a ‘hold’ tenendo te/np/endo 

abbia ‘have’ avendo abbiendo 

pia/ti/a ‘please’ ` pia/tf/endo pia/tt[/endo 

di/k/a 'say' di/t(/endo **di/k/endo 

pian/g/a ‘weep’ ` pian/dj/endo **pian/g/endo 


While it would have been impossible to explain the distribution of the alternants in 
purely phonological terms (for the reasons, see Maiden 2013: 25-31), this behaviour 
clearly suggests residual sensitivity to phonologically plausible environments for the dis- 
tribution of certain alternants. 

In short, while Anderson's analysis of the Savognin data seems to me too “phonol- 
ogizing”, it may be that my own approach, insisting on purely morphological aspects 


14 But see also Maiden (2013: 31-35). 
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of the phenomenon, has been too “morphologizing”, and both approaches seem to be 
subject to the questionable assumption that redundancy must be eliminated from the 
analysis. I do not think that Savognin is as “pure” an example of “phonologically condi- 
tioned allomorphy” as Anderson believes, but the possibility that speakers are sensitive 
to the recurrent correlation between certain types of alternation and stress should not 
be sacrificed too hastily on the altar of formal economy. As Sims (2015: 205f.) observes, 
our two approaches need not in fact be mutually exclusive. We have probably reached 
the point where only appropriately devised psycholinguistic experimentation would be 
able to tell us more about the Savognin data. However that may be, morphologists and 
Romance linguists are truly in Steve Anderson’s debt for having focused our attention 
so sharply on these fascinating data. 


Abbreviations 


AIS Jaberg & Jud (1964) 
ALDI Goebl et al. (1998) 
ALDII Goebl et al. (2012) 
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How to wake up irregular (and 
speechless) 


Charles Yang 


University of Pennsylvania 


I suggest that morphological defectiveness arises when the learner fails to discover a pro- 
ductive/default process in a morphological category. The detection of productivity, or lack 
thereof, can be accomplished by the Tolerance Principle, a simple mathematical model of 
language learning and generalization. In this paper, I show that the absence of “amn’t, the 
negative contracted form of am, in most English dialects can be predicted on purely numer- 
ical basis. Implications for language acquisition, variation, and change are also discussed. 


1 From Irregular Verbs to Productivity 


In my first linguistics talk, which was also my job interview at Yale, I proposed that 
English irregular past tense is not learned by forming associations between the stem and 
the inflected form, contrary to the dominant view in the psychological study of language 
(Rumelhart & McClelland 1986; Pinker 1999). Rather, irregular past tense is generated 
by morpholexical rules. These rules do not generalize beyond a fixed list but are rules 
nevertheless, in the sense that they take the stem (e.g., think) as the input and generate 
an output (e.g., thought), the inflection, via a computational process of structural change 
(e.g., “Rime — /ot/”). I was approaching the problem as a computer scientist: rules 
are most naturally realized as a list of if-then statements, for regulars and irregulars 
alike, which turns out to be the approach taken throughout the history of linguistics 
(Bloch 1947; Chomsky & Halle 1968; Halle & Marantz 1993) including Steve's own work 
(1973; 1992). There is in fact developmental evidence for the rule-based approach when 
I reanalyzed the past tense acquisition data purportedly confirming the associationist 
account (Yang 2002b). 

I supposed Steve was at least somewhat persuaded by the argument; a few months 
later I got the job. But he did wonder aloud after the talk, with a quizzical frown-cum- 
smile that only he can manage: “But how does a rule wake up in the morning and decide 
to be irregular?" 


Charles Yang. 2017. How to wake up irregular (and speechless). In Claire Bowern, 
| Laurence Horn & Raffaella Zanuttini (eds.), On ooking into words (and beyond), 211- 
233. Berlin: Language Science Press. ) 


Charles Yang 


Indeed. Since words do not wear tags of (ir)regularity, any morphological theory that 
recognizes regularity and irregularity, which is pretty much everything on the market, 
must say something about how a rule or process wakes up to be irregular. In fact, the- 
ories that reject such a categorical distinction (eg. Hay & Baayen 2003; McClelland 
& Patterson 2002) ought to be off market. Children’s morphological productivity is 
strongly discrete; see Yang 2016: Chapter 2 for a cross-linguistic review. Their errors 
are almost exclusively over-regularizations of productive rules. This is quite well known 
thanks to the past tense debate: for example, the past tense of hold sometimes surfaces 
as holded, with the “-d” rule (Marcus et al. 1992). What is not widely known and even 
less appreciated is the near total absence of over-irregularization errors, despite frequent 
anecdotes to the contrary (e.g., bite-bote, wipe-wope, think-thunk, etc.; Bowerman 1982; 
Bybee 1985; Pinker 1999). These errors are sufficiently rare, occurring in about 0.2% of 
English-learning children’s past tense use, that Xu & Pinker (1995) dub them “weird past 
tense errors”. Not a single instance of bote, wope, thunk, or many conceivable analog- 
ical patterns can be found in the millions of child English words in the public domain 
(MacWhinney 2000). The distinction between regular and irregular rules was in fact 
observed in Berko’s (1958) original Wug test. While children were quite happy to add 
“-d” to novel verbs such as rick and spow, only one out of eighty six subjects irregular- 
ized bing and gling, although adults are often prone to form irregular analogies in an 
experimental setting.’ 

So Steve’s question sent me on a long quest. To maintain that both regulars and irreg- 
ulars are computed by rules, I needed a story of how children separate out productive 
and unproductive rules so precisely and effortlessly. Although a solution was worked 
out shortly after (Yang 2002a), it took me many years to fully recognize the scope of the 
productivity problem - one of the “central mysteries” in morphology (Aronoff 1976: 35) 
- and the challenges it poses. 

At a first glance, it doesn't seem difficult to give an answer for English past tense. The 
rule “add -d” covers most verb types in the language and can thus be deemed regular, as 
"statistical predominance" has always been regarded as the hallmark of the default (e.g., 
Nida 1949: 14). But this is surely too simplistic when crosslinguistic and psychological 
factors are taken into account. More concretely, at least four empirical problems, each of 
which is illustrated with a familiar example in (1), fall under the productivity problem. 


(1) a. English past tense: That a default rule is learned abruptly and results in over- 
regularization, after a protracted stage of rote memorization (Marcus et al. 

1992; Yang 2002b). 
b. English stress: That the grammar of English stress (Chomsky & Halle 1968; 
Hayes 1982; Halle & Vergnaud 1987) is not trochaic with a list of lexical excep- 


tions despite an overwhelming majority of English words bearing stress on 
the first syllable (Cutler & Carter 1987; Legate & Yang 2013). 


! This suggests that the Wug test and similar methods such as rating have task-specific complications and 
should not be taken as a direct reflection of an individual's morphological knowledge; see Schütze 2005 
and Yang 2016 for discussion. 
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c. German noun plurals: That a suffix (“-s”) can be the productive default despite 
coverage of fewer nouns than any of its four competitors (Clahsen et al. 1992; 
Wiese 1996). 


d. Russian gaps: That morphological categories needn’t and sometimes do not 
have a default, as illustrated by the missing inflections of certain Russian verbs 
in the 1st person singular non-past (Halle 1973). 


In Yang (2016), I propose a model of productivity, the Tolerance Principle, which pro- 
vides a unified solution for the problems in (1), as well as similar problems that involve 
inductive learning in phonology, syntax, and language change. In this paper, I revisit 
Steve’s question which, in a significant way, drove this project forward. My focus is on 
a topic that has featured prominently in Steve’s recent research: morphological gaps and 
the nature of defectiveness in word formation (e.g., Anderson 2008; 2010b). 


2 The Tolerance Principle 


The development of the Tolerance Principle started as a purely formal conjecture: How 
would one represent a rule (R) and the exceptions of that rule (e.g., a set of words wi, w2, 
.. Wn)? If one is committed to a mechanistic account of the matter - like a computer 
programmer, for instance - perhaps the only way to encode rules and exceptions is 
through a set of conditional statements: 


(2 Ifw= w Then... 


If w = wa Then... 
If w = we Then... 
Apply R 


This of course immediately recalls the Elsewhere Condition, ever present in linguistics 
since Panini (Anderson 1969; Aronoff 1976; Kiparsky 1973; Halle & Marantz 1993). In 
particular, the data structure in (2) entails that in order for a (productive) rule to apply 
to a word, the system must scan through a list to ensure that it is not one of the exceptions 
(wi, wa, ..., We). 

There is something perverse about (2). For example, to produce walked, one must 
scan through the irregular verbs to make sure that walk is not found on the list. But a 
moment of reflection suggests that the Elsewhere Condition makes perfect sense. The 
alternative to listing the irregulars would have to be listing the regulars. One can imag- 
ine assigning each regular verb a flag, which immediately triggers the application of 
the “add -d" rule. But that would imply that the morphological status of every word 
must be committed to special memory; the irregulars as well, since they are by defini- 
tion unpredictable. Perhaps even more surprisingly, there is broad behavioral support 
for the irregulars-first-regulars-later representation of rules; see Yang 2016: Chapter 3 
for review. The psycholinguistic evidence comes from real-time processing of words 
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and morphology. When irregulars and regulars are suitably matched for various fac- 
tors (e.g., stem and surface frequency) that affect the speed of processing, irregulars are 
recognized and produced significantly faster than regulars — which is consistent with 
the algorithmic interpretation of the Elsewhere Condition as a computational model of 
language processing. 

From (2), then, we can develop an empirically motivated cost-benefit calculus for the 
price of exceptions. Specifically, words that fall under a productive rule must “wait” 
for the exceptions to be processed first: the more exceptions there are, the longer the 
rule will have to wait. Under very general assumptions about word frequencies, we can 
prove: 


(3) Tolerance Principle 
Suppose a rule R is applicable to N items in a learner's vocabulary, of which e are 
exceptions that do not follow R. The sufficient and necessary condition for the 
productivity of R is: 


N 
InN 


The Tolerance Principle requires two input values, N and e, and returns the productivity 
status of a rule. Its application requires a well-defined rule such that N and e can be 
measured, by the child learner during language acquisition and by the researcher when 
studying linguistic productivity. To learn the structural description of a rule, typically in 
the form of “X — Y”, one will need to invoke inductive learning models such as those 
studied in artificial intelligence, cognitive science, and indeed linguistics (e.g., Chomsky 
1955). Almost all inductive models form generalizations over specific learning instances 
and try to discover the shared characteristics of individual elements associated with a 
shared pattern. For example, suppose two good baseball hitters can be described with 
feature bundles [+red cap, +black shirt, +long socks] and [+red cap, +black shirt, +short 
socks]. The rule “[+red cap, «black shirt] —9 good hitter” will follow, as the shared 
features (cap, shirt) are retained and the conflicting feature (sock) is neutralized. Obvi- 
ously, the application of inductive learning must encode the structural constraints on 
the human language faculty and other cognitive systems implicated in language acqui- 
sition (Chomsky 1965). While it is clear that the properties of human language are far 
from arbitrary, it remains an open question to what extent they reflect a unique system 
of Universal Grammar (e.g., Merge; Berwick & Chomsky 2016) or general principles of 
cognition and learning that show continuities with other domains and species; see Yang 
2004; Chomsky 2005; Yang et al. 2017 for general discussions. 

Table 1 provides some sample values of N and the associate threshold value Oy. 

The apparently, and perhaps surprisingly, low threshold has interesting implications 
for language acquisition. Most importantly, it suggests that all things being equal, smaller 
vocabulary (smaller values of N) can tolerate relatively more exceptions. That is, pro- 
ductive rules are more detectable when learners have less experience with a language, 
especially when they have a small lexicon that only consists of relatively high frequency 
words. This may explain children's remarkably early command of the main ingredients 


e € On where Oy := 
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Table 1: The tolerance threshold for rules of varying sizes 


N On % 


10 4 40.0 
20 7 35.0 
50 13 26.0 
100 22 22.0 
200 38 19.0 
500 80 16.0 
1000 145 145 
5,000 587 11.7 


of language (Yang 2013), as well as the reason why maturational constraints may aid 
rather than hamper language acquisition (Newport 1990); see Yang 2016: Chapter 7 for 
extensive discussion. 

The Tolerance Principle has proved highly effective. In Yang (2016), it was applied 
almost 100 times, making accurate productivity predictions across many languages and 
domains using only corpus statistics. Furthermore, experimental studies in collabora- 
tion with Kathryn Schuler and Elissa Newport have found near categorical confirmation 
for the Tolerance Principle in artificial language studies with young children (Schuler, 
Yang & Newport 2016). Some of these robust results are unexpected. This is because the 
derivation in (3) makes use of numerical approximations that only hold when N is large. 
In the empirical case studies, however, the value of N is often very modest (e.g., 8 or 
9 in the artificial language studies) as it refers to the number distinct lexical items in a 
morphological category. For the moment, I put these questions aside and return to the 
problems in (1): the low threshold of exceptions provides just the right approach to the 
productivity problem across languages. 

Consider first the acquisition of English past tense. Through an inductive process il- 
lustrated earlier, the phonological diversity of the regulars will quickly establish that 
any verb can take the “-d” suffix. Its productivity will be determined by the total number 
of verbs (N) and the irregulars (e) in the learner’s vocabulary. The same consideration 
applies to the irregular rules. For instance, the seven irregular verbs bring, buy, catch, 
fight, seek, teach, and think all follow the stem change “ought”. Such a mixed bag of 
phonological shapes will also yield an all-inclusive rule, as shown by computational im- 
plementations (Yip & Sussman 1998). But the “ought” rule will fare terribly. It only works 
for seven items, with hundreds and thousands of exceptions, far exceeding the tolerance 
threshold. As a result, the rule will be relegated to lexicalization. Other irregular pat- 
terns can be analyzed similarly: as I show elsewhere (Yang 2016: Chapter 4), they all 
wake up nonproductive in the morning, thereby accounting for the near total absence 
of over-irregularization errors (Xu & Pinker 1995). 
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Following the same logic, we can see that the emergence of the “-d” rule will require a 
long period of gestation. Although children can quickly induce its structural description 
- using no more than a few dozen verbs (again Yip & Sussman 1998) - their early verbs 
will contain many irregulars. Of the top 200 verbs inflected in the past tense (MacWhin- 
ney 2000), 76 are irregulars. Because Gan is only 37, it follows that children who know 
some 200 most frequent verbs cannot recognize the productivity of “-d” despite its “sta- 
tistical predominance”. During this period of time, even though verbs may be produced 
with the “-d” suffix, they are in effect irregular: the suffix has no productivity and does 
not extend beyond a fixed list rote-learned from the input. The telltale evidence for pro- 
ductivity comes from the first attested overregularization errors (Marcus et al. 1992). For 
individual learners with reasonably complete records of language development, the Tol- 
erance Principle can help us understand why the regular rule becomes productive at 
that exact moment it did. For example, "Adam", the poster child of English past tense 
research (Pinker 1999), produced his first over-regularization error at 2;11: “What dat 
feeled like?” In the transcript of almost a year prior to that point, not a single irregular 
verb past tense was used incorrectly. It must be that by 2;11, Adam had acquired a suf- 
ficiently large number of regulars to overwhelm the irregulars. To test this prediction, I 
extracted every verb stem in Adam’s speech until 2;11. There are N = 300 verbs in all, 
out of which e = 57 are irregulars. This is very close to the predicted 0599 = 53, and 
the small discrepancy may be due to the under-sampling of the regulars, which tend 
to be less frequent and thus more likely missing from the corpus. The critical point to 
note here is that Adam apparently needed a filibuster-proof majority of regular verbs 
to acquire the “-d” rule: this is strongly consistent with the predictions of the Tolerance 
Principle as illustrated in Table 1. 

The problems of English stress and German plurals in (1) are similar. In the English 
case, the assignment of stress to the first syllable may be transiently productive when 
the child has a very small vocabulary (Kehoe & Stoel-Gammon 1997; Legate & Yang 2013). 
But it will fail to clear the tolerance threshold when the vocabulary reaches a modest 
size: even 85% of coverage is not sufficient for larger values of N (e.g., 5000; Table 1). In 
the German case, none of the five plural suffixes can tolerate the other four as excep- 
tions, not least the “-s” suffix, which covers the smallest set. In both cases, the learner 
will carry out recursive applications of the Tolerance Principle. When no rule emerges 
as productive over the totality of a lexical set, the learner will subdivide it along some 
linguistic dimension, presumably making use of constraints on language and other cog- 
nitive systems, and attempt to discover productive rules within. Such a move, while more 
complex, is always more likely to yield productive rules: again, smaller N’s that result 
from subdividing the lexicon tolerate a relatively higher proportion of exceptions than 
larger N’s. For the acquisition of stress, dividing words into nouns and verbs and taking 
the syllabic weight into account, as prescribed by all modern metrical theories, lead to 
productive rules of stress assignment, an outcome that accords well with both structural 
and behavioral findings (Ladefoged & Fromkin 1968; Baker & Smith 1976; Kelly 1992; 
Guion et al. 2003). The study by Legate & Yang (2013) also reveals important differences 
between theories of stress in their statistical coverage of the English lexicon: while all 
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theories handle a great majority of English words, only the theory of Halle 1998 clears 
the tolerance threshold of exceptions. For the acquisition of German plurals, the move is 
to subdivide the nouns by grammatical gender as well as phonological conditions, sim- 
ilar to certain theoretical approaches to German morphology (e.g., Wiese 1996). The -s 
suffix indeed survives as the default because the other suffixes are productive with more 
restrictive domains of nouns. 

The emergence of morphological gaps is a logical outcome of the Tolerance Principle, 
which constitutes the topic of the present study. When a rule wakes up irregular, the 
learner must learn, from positive evidence, the inflected form for each word. Failing to 
hear a particular inflected form will render the speaker speechless when that form is 
needed. 


3 Why Am+Not + Amn't? 


3.1 Conditions on Gaps 


Many current theories of morphology, including Distributed Morphology (for which 
see Halle & Marantz 1993), Optimality Theory (Prince & Smolensky 2004), Dual-Route 
Morphology (Pinker 1999; Clahsen 1999), Network Morphology (Brown & Hippisley 
2012), Paradigm Function Morphology (Stump 2001) and others, invoke the notion of 
competition, which by design results in a default or winning form (at least in the in- 
flectional domain). This architectural feature of the theories is inherently incompatible 
with the existence of morphological gaps, which are quite widespread across languages 
(Baerman, Corbett & Brown 2010). The Tolerance based approach, while also competi- 
tion based (through the Elsewhere Condition), does not stipulate a default or productive 
rule as a primitive in the theoretical machinery. Rather, the presence or absence of a 
productive rule is the outcome of language acquisition, to be determined by children 
through the composition of the linguistic data. More specifically, the Tolerance Princi- 
ple provides the following corollary (Yang 2016: 142): 


(4) Conditions on gaps 
Consider a morphological category C with S alternations, each affecting N; lexical 
items (1 € i € S), and >); N; = N. Gaps arise in C only if: 


Vi,1<i< S, 5 Nj > On 
Tei 
That is, none of the alternations (S;) in N are sufficiently numerous to tolerate all the rest 
La Nj) as exceptions: no productive alternation will be identified. The speaker must 
hear the morphological realization of every word in C; if any is to slip through the cracks, 
a defective gap appears. I should note that in the conception and application of the Tol- 
erance Principle, the terms such as “category” and “alternation” are meant to be general 
and not restricted to morphology per se. For instance, “category” can be interpreted as 
any well-defined structural class with a finite number of elements (phonemes, words, 
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morphosyntactic structures, the directionality of a finite number of functional heads, 
etc.), and “alternation” can be understood as any outcome of a computational process 
defined over such a class. The Tolerance Principle asserts that in order for a productive 
pattern to emerge, one of the alternations must be statistically dominant. Elsewhere I 
have studied several well-known gaps in English, Polish, Spanish, and Russian (Yang 
2016: Chapter 5). Their presence is predictable entirely on numerical ground, requiring 
nothing more than tallying up the counts of the lexical items subject to each alterna- 
tion. In what follows, I provide a Tolerance Principle account of another much-studied 
instance of a defective paradigm. 


3.2 The Statistics of N’t Gaps 


In many dialects of English, n’t is not permitted to contract onto auxiliary verbs such 
as am and may, as seen in the unavailability of, for example, “I amn't tired" and ““You 
mayn't do that” (e.g., Anderwald 2003a; Bresnan 2001; Broadbent 2009; Frampton 2001; 
Hudson 2000; Zwicky & Pullum 1983). Following Zwicky & Pullum (1983), I will assume 
that the contracted negative n’t is an inflectional affix. The question is why n’t cannot 
attach to all auxiliary verbs residing in the Tense node. From the perspective of the 
Tolerance Principle, the emergence of gaps must result from a critical mass of exceptions 
to the contraction process. 

Let us consider the behavior of the auxiliary hosts for n’t. Zwicky & Pullum (1983: 
p508) provide a near comprehensive list, which I revise with some additional information 
in Table 2. 

Table 2 provides the frequencies of the auxiliary verbs and their negation in both 
uncontracted and contracted forms in the 520-million-word Corpus of Contemporary 
American English (COCA; Davies 2008). Given the heterogeneity of the textual sources, 
a handful tokens of amn’t and mayn’t can be found albeit at very low frequencies. The 
n't-contracted forms of shall and dare — shan't and daren’t — are also impossible for most 
American English speakers but are included here for completeness. Although shan’t is 
often perceived as a stereotypically British English feature, it seems to be vanishing 
across the pond as well. In a 6.6-million-word corpus of British English (MacWhinney 
2000), not a single instance of shan't is found. And its frequency of usage has been in a 
steady decline since 1800, the beginning date of the Google Books Corpus. As of daren't, 
the OED does not provide any citation and it has always been very rare throughout the 
period of the Google Books Corpus. These gapped forms are marked by Ø. 

The prescriptively maligned ain't ([emt]), however, is robustly attested for am, are, is, 
have, and has in COCA as well as a six-million-word corpus of child-directed American 
English (MacWhinney 2000). Since the phonological form of [emt] is unpredictable from 
the auxiliary host, it is boldfaced in Table 2 to mark its idiosyncrasy, along with a few 
other exceptions to which I return later. Note that the frequency estimates of the ain’t 
forms are approximate. First, I only counted strings where ain’t is immediately preceded 
by a pronoun - the majority case, but sentences with a lexical subject (e.g., “Kids ain't 
ready”) are not included. Second, because both be and have can take on ain’t, the counts 
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Table 2: The morphophonological alternation of n’t contraction 


aux+not n't contraction (96) 
could [kud] 45,256 ` [kudnt] 106,123 ^ 70104 
did [did] 128,432 [didnt] 342,202 — 7271 
does [daz] 72,494 [daznt] 164,922 69.553 
had [hed] 27410  [haednt] 46,987 63.157 
has [haez] 28,529  [hseznt] 29,578 50.255 
has [haez] 28,529 [ernt] 749 1.273 
have [hzev] 24,957 [heevnt] 45,849 63.868 
have [hzv] 24,957 [ernt] 981 1.367 
is [1z] 189,538  [iznt] 100,164 34275 
is [rz] 189,38 [emt] 2,537 0.868 
might [mart] 14,80  [martnt] 78 0.525 
must [mast] 4156  [masnt] 917 18.076 
need [nid] 3,705  [nidnt] 1235 25.000 
ought [ot] 1031 [otnt] 66 6.016 
should [fud] 20,77  [fudnt] 25,576 55.416 
was [waz] 97,457  [waznt] 141,384 59.196 
would [wud] 46,205 [wudnt] 85,853 65.012 
am [aem] 10258 Ø 5 0.041 
am [aem] 10,258 [ernt] 2,046 16.622 
are [ar] 89,083  [arnt] 50,137 35.602 
are [ar] 89,083  [ernt] 1,073 0.765 
can [ken] 75,5331  [kznt] 201,060 72.692 
dare [deor] 320 Ø 25 7.246 
do [du] 81,074 [dont] 654,576 88.979 
may [mer] 363195 Ø 12 0.033 
shall [fæl] 1,271 Ø 123 8.824 
were [wr] 41,224  [wrnt] 35,120 46.002 
will [wil] 39,068 [wont] 86,158 68.802 


for the auxiliaries are parceled out by extrapolating from the frequencies of the regularly 
contracted n't forms? For instance, there are 2,054 instances of “you/they ain't": the 
"share" for are is based on the count of "aren't" (50,137) relative to "haven't" (45,849). 


2 Here I gloss over the fact that there are English dialects in which ain't is also an alternative form of negative 
contraction for do, does, and did (e.g., Labov et al. 1968; Weldon 1994). It would be difficult to estimate their 
frequencies but formally, this use of ain't serves to create additional (unpredictable) exceptions to the 
contraction process which, as we discuss below, contributes to the breakdown of productivity and the 
emergence of gaps. 
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This amounts to 52.2% of 2,054, or 1,073, as recorded in the Table. Finally, the estimate of 
ain’t as the contraction of am + n’t cannot follow a similar process because, of course, 
amn't is gapped. I thus allocated roughly 75% of the “I ain't" counts, which is the share 
of “I am not” out of the total of “I am not” and “I have not”, to the contraction of am not. 
For these five auxiliaries that can be realized as ain’t, the percentage of the contracted 
forms are based on the sum of uncontracted, n’t-contracted, and ain’t-contracted forms. 
More precise estimates are certainly possible but as we will see, the exact frequencies 
are not especially important for our purposes: it is more pertinent to approximate a 
“typical” English speaker’s experience with these forms. Roughly, we would like to know 
whether an English speaker will have encountered a specific phonological word at all, by 
using some independently motivated frequency threshold (e.g., once per million; Nagy 
& Anderson 1984): it is evident that the frequency of ain’t is sufficiently high for this 
threshold despite our rough estimates. 

A tempting approach to gaps is to appeal to indirect negative evidence (Chomsky 
1981; Pinker 1989). A strong version takes the shape of lexical conservatism: do not use a 
form unless it is explicitly attested. This recalls Halle’s [-Lexical Insertion] treatment of 
gaps in his classic paper (1973) and can be found in recent works as well (e.g., Pertsova 
2005; Steriade 1997; Rice 2005; Wolf & McCarthy 2009). A weak version makes use of 
frequency information. For instance, if amn’t were possible, language learners would 
have surely heard it in the input, especially since am is highly frequent and would have 
had plenty of opportunities to undergo n’t contraction. Its conspicuous absence, then, 
would provide evidence for its ungrammaticality (e.g., Daland, Sims & Pierrehumbert 
2007; Sims 2006; Baerman 2008; Albright 2009). 

Traditional acquisition research has always viewed indirect negative evidence with 
strong suspicion (Berwick 1985; Osherson, Stob & Weinstein 1986; Pinker 1989). Research 
on the amn't gap (e.g. Hudson 2000) has also questioned its usefulness. However, with 
the recent rise of probabilistic approaches to language acquisition especially Bayesian 
models of inference, the field has seen a revival of indirect negative evidence. If the 
conception of learning is a zero-sum - or more precisely, one-sum — game which assigns 
a probabilistic distribution over all linguistic forms, the unattested will necessarily lose 
out to the attested, at least in most probabilistic models of language learning. A thorough 
assessment of indirect negative evidence within a probabilistic framework is beyond the 
scope of the present paper; see Niyogi 2006; Yang 2015; Yang et al. 2017. But a careful 
statistical examination of gaps serves to reveal its deficiencies. Note that the question is 
not whether indirect negative evidence can account for some missing forms: the absence 
of amn’t is indeed unexpected under any reasonable formulation. The real challenge is to 
ensure that indirect negative evidence will pick out only the gapped forms but nothing 
else, while keeping in mind that morphological inflection is generally not gapped but 
fully productive, readily extending to novel items. 

Two observations can be made about the frequency statistics in Table 2, which sug- 
gest that indirect negative evidence is unlikely to succeed. First the n’t forms of several 
auxiliaries such as might and need are in fact quite rare. They appear considerably less 
frequently than once per million, which is generally regarded as the minimum threshold 
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to guarantee exposure for most English speakers (Nagy & Anderson 1984). In the six- 
million-word corpus of child-directed American English (MacWhinney 2000), mightn’t 
appears only once, needn’t appears only twice, and mustn’t does not appear at all. In 
the other words, these n’t forms may be so rare that they are in effect absent for many 
children (Hart & Risley 1995). Lexical conservatism thus will not distinguish them from 
the truly gapped amn't, mayn’t, daren’t, and shan’t, the last of which is in fact more fre- 
quently attested in COCA. Second, consider a statistical interpretation of indirect nega- 
tive evidence. The last column of Table 2 provides the percentage of the n’t contraction 
out of all negated forms. An auxiliary with an unusually low ratio may mean that it has 
performed below expectation and could be a clue for its defectiveness. However, the 
statistics in Table 1 suggest otherwise. It is true that amn’t and mayn’t have very low 
ratios: this fact alone is not remarkable because these are indeed gaps. But exactly how 
low should a ratio be for the learner to regard a contracted form to be defective? On the 
one hand, we have mightn’t and oughtn’t at 0.525% and 6.016%, and these are not defec- 
tive. On the other hand, we have daren’t and shan’t at 7.246% and 8.824%, but these in 
fact are defective. There doesn’t appear to be a threshold of frequency or probability that 
can unambiguously distinguish gapped from ungapped items. 


3.3 N’t Contraction in Language Development and Change 


Let’s see how the Tolerance Principle provides an account of the amn’t gaps. The simplest 
approach is to consider all the auxiliary verbs and their n’t contractions collectively as a 
homogeneous set. Using the once-per-million threshold as a reasonable approximation 
of a typical American English speaker’s vocabulary, and taking the size of the Corpus 
of Contemporary American English (520 million words) into account, there are 18 auxil- 
iaries with reliably attested n’t forms. The four gapped forms are all below this threshold 
and are thus excluded from consideration. It is important to clarify that, unlike various 
forms of lexical conservatism and indirect negative evidence discussed earlier, we do 
not regard the absence of these forms as evidence for their defectiveness. Rather, the 
learner's task is to deduce, on the basis of the 18 well-attested forms, including am~ain’t, 
that n’t contraction is not a productive pattern in the English auxiliary system. 
This is quite easily accomplished. Of the 18 auxiliaries, n’t is realized as follows: 


(5) [nt]: could, did, does, had, need, should, was, would (8) 


[emt] in variation with either [nt] or [nt]: have, has, is, are (4) 


TOP 


[nt]: can, were (2) 

idiosyncratic vowel change: do, will (2) 

[emt]: am (1) 

[nt] but idiosyncratically deletes [t] in the auxiliary (see Zwicky & Pullum 
1983: 508—509 for discussion): must (1) 


mo BO 


For any of these alternations to be productive, it must have no more than jg = 6 excep- 
tions. The most promising [nt], which applies to 8 auxiliaries and thus has 10 exceptions, 
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is a long way off. Even if we are to include the [nt]-taking have, has, and is and ignore 
the unpredictable variant [emt] form, the rule “nt — nt” still falls short of productivity. 
Thus, the learner will be able to conclude, from the Conditions on Gaps (4), that n’t con- 
traction is not a productive process for English auxiliaries and must be learned lexically. 
If amn't fails to appear in the input, it will be absent. Only after the learner has already 
concluded that a category does not have a productive rule can they start to regard the 
absence of evidence as evidence of absence. 

The preceding analysis, while correctly identifies the n't gaps, has some inadequacies. 
For one thing, based on the 18 contracted forms, the primary evidence for language ac- 
quisition, learners would also identify mustn't and oughtn't as gapped as they fall below 
the minimum frequency of once per million. This is not necessarily a fatal shortcoming: 
mustn't and oughtn't are still considerably more frequent than amn’t and mayn't, the two 
genuinely gapped forms, and children may acquire them in later stages of acquisition. 
But more significantly, as Steve pointed out to me in a personal communication (unre- 
lated to the current celebratory volume), the preceding brute-force approach misses an 
important structural generalization. Table 2 is divided into two halves on Steve's advice. 
As he insightfully observes, none of the auxiliaries that ends in an obstruent is gapped; 
these are listed in the top portion of the Table. By contrast, gaps are only found in the 
auxiliaries that do not end in an obstruent, which are listed in the bottom portion of the 
Table. 

If we carry out a Tolerance analysis along the feature [+obstruent], a much more 
elegant and interesting pattern emerges. For the 12 [+obstruent] auxiliaries, only four 
have exceptions — has, have, is, and must — just below 0j; = 4. Thus, English learners 
can identify a productive rule: 


(6) nt—o nt/[*obstruent] d ` 


This immediately accounts for the fact that speakers generally accept the forms mightn't 
and oughtn't despite their very low frequencies (well below once per million): these 
two auxiliaries, of course, follow the structural description of (6). By contrast, amn t, 
mayn't, daren't, and shan't, some of which appear more frequently than mightn't and 
oughtn't, are generally rejected because they fail to meet the structural descriptions of 
the productive rule in (6). 

Consider now the six [-obstruent] auxiliaries in the bottom portion of Table 2. Here 
am and are have [emt], can and were add [nt], and do and will have idiosyncratic vowel 
changes. Since the Tolerance threshold 0s = 3, no distinct pattern will be identified as 
productive: lexicalization is required and gaps are predicted for mayn’t, daren't, shalln’t, 
and of course amn't. 

The calculation here is very delicate but it is interesting to push the Tolerance Principle 
to the limit. What if the child has not learned ain't as the n't-contracted form for am 
and are? Although ain't forms are quite robustly attested in COCA as well as in child- 
directed English, they are still strongly dialectal and are, at least in the input to some 
children, less frequent than the “regular” forms such as aren't, isn't, haven't, and hasn't. 
If so, a child during an early stage of acquisition may in effect have only five [-obstruent] 
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auxiliaries and their contracted forms to learn from: namely, are, can, do, were, and will. 
Here the statistically dominant pattern of “nt — [nt] / [-obstruent] & — " does reach 
productivity: the two idiosyncratic exceptions of do and will fall below the threshold of 
05 = 3, and n't contraction is predicted to be transiently productive! 

Bill Labov (personal communication) distinctly recalls being a young amn’t speaker 
only to exit that stage at a later time. Indeed, we can find attested examples in Amer- 
ican English-learning children's speech. The three examples in (7) are taken from the 


CHILDES database (MacWhinney 2000): 


(7 a. I amn't a dad. (Kate/Kim, 3;6: Sawyer Corpus 3-12-92.cha) 
b. I'm doing this puzzle well, amn't I? (Mark, 3;11: MacWhinney Corpus 67b1.cha) 
c. Amn'tI clever? (Mark, 3;11: MacWhinney Corpus 67b2.cha) 


The reader is encouraged to listen the audio recordings of the examples in (7) in the 
CHILDES database. The first child's identity is unclear due to discrepancies in transcrip- 
tion. The examples from Mark can be heard as the investigator's exact revoicing (Brian 
MacWhinney, personal communication). Although three examples seem quite rare, it is 
worth noting that almost all am's are contracted onto the pronoun (i.e., I’m not). Of the 
one million American English child utterances, there are only 42 full forms of am fol- 
lowed by negation (i.e., I am not), which makes the three amn’t errors not so negligible. 
Of course, everyone eventually hears ‘I ain’t’: from pop songs on radio if not from the im- 
mediate family and friends. Thus, amn’t will disappear according to the Tolerance-based 
analysis, for ain't introduces an additional exception which leads to the breakdown of 
productivity for the [-obstruent] class. 

Corroborative evidence for the (transient) productivity of n't contraction can also be 
found in other auxiliaries. To my great surprise, there are numerous instances of wilIn't 
as the negative contracted form of will and whyn't for ‘why don't/didn't in the speech 
of many parent-child dyads, apparently all from the New England region. Other than 
enriching the empirical data on contraction, willn’t and whyn’t do not tell us much about 
the productivity of n’t contraction or its acquisition: if parents use them frequently, and 
they do, children will follow. Nevertheless, willn’t can also be found in the spontaneous 
speech of children who are not from the New England region? 


(8) a. No we willn't. (Ross 2;9, Colorado, MacWhinney Corpus 26b2.cha) 
b. Oh it willn't fit in there (Marie 6;6, Ontario, Evans Corpus dyad07.cha) 
c. He willn't be a good boy (Jared 6;7, Ontario, Evans Corpus dyad19.cha) 


Perhaps most strikingly is an utterance produced by Sarah, a child from the Harvard 
studies (Brown 1973): 


(9) And the reindeer saidn't. 


3 Brain MacWhinney (personal communication) confirmed that the only time he or his wife ever used willn’t 
was when transcribing Ross's speech. 

^ The contraction of n't onto the main verb as in (9) was attested in the history of English: see Brainerd 1989 
for caren't (‘don’t care’) and Jespersen 1917 for bettern't, usen't, and indeed whyn't. 
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Taken together, the examples in (7), (8), and (9) suggest that n’t contraction is at least 
transiently productive for some English-learning children. 

Ross’s willn’t presents an especially interesting opportunity for studying the produc- 
tivity of n’t contraction. The CHILDES corpus contains a relatively extensive record 
of Ross’s longitudinal language development. We can then study his auxiliaries and 
contractions, and subject his individual grammar to the kind of fine-grained analysis 
of Adam's past tense (§2). By the time Ross produced No we willn’t, he had used 9 n't- 
contracted auxiliaries: 


(10) a. couldn't, didn't, haven't, isn’t, wouldn't 


b. aren't, can't, don't, won't 


If Ross had not started partitioning the auxiliaries by the [+obstruent] feature, the 
N = 9 examples in (10) supports the productive use of n't contraction because the four 
examples in (10b) are below the number of tolerable exceptions (09 = 4.2). The 5/4 split 
between rule-governed and exceptional items is exactly the stimuli used in the artificial 
language study (Schuler, Yang & Newport 2016) where children nearly categorically gen- 
eralized the rule. If he failed to distinguish the syllabic [n] in (10a) and the nonsyllabic 
[n] in aren't and can't in (10b), it would have been even easier for n't contraction to 
reach productivity. Thus, Ross's productive use of n't contraction in (8) is predicted by 
the Tolerance Principle. 

The naturalistic evidence from child language is admittedly thin, but it suggests that 
the emergence of the amn't and other gaps in the auxiliary system may be due to the 
use of ain't. Again, the gaps would not be the result of mutual exclusivity: there are 
doublets such as haven’t~ain’t etc. so amn't and ain't could have coexisted side by 
side. Gaps arise/arose because the form of ain't weakens the numerical advantage of n't 
contraction, pushing it below the Tolerance threshold. 

Finally, a little historical detective work bolsters our treatment of the amn't gap? Ac- 
cording to Jespersen (1917: 117), “the contracted forms seem to have come into use in 
speech, though not yet in writing, about the year 1600” The change appears to have 
originated in non-standard speech before spreading to mainstream usage. Subsequent 
scholarship, however, places the date to a somewhat later time (e.g., 1630, Brainerd 1989: 
181; see also Warner 1993: 208-209). Pursuing the results from the Tolerance-based anal- 
ysis, we can make two observations. 

First, it is likely that n’t-contraction was at one point productive, which seems espe- 
cially effective for the [+obstruent] auxiliaries; see also (9) and fn 4. Brainerd's study 
finds that didn't, hadn't, shouldn't, and wouldn't appeared from 1670s, soon after the n't 
contraction appeared in the English language. These were followed by couldn't, mightn't, 
needn't, and mustn't in the 18th century, and the last to join the group was oughtn't in 
the 19th century, first attested in Dicken's 1836 The Village Coquette. Thus speakers at 
that time must have formed a productive contraction rule for [+obstruent] auxiliaries, 
perhaps like the one given in (6). Following this line of reasoning, we make the pre- 


5Tam grateful to Anthony Warner for pointing out the important study of Brainerd 1989. 
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diction, admittedly one that is difficult to test, that if a new [+obstruent] auxiliary is to 
appear in the language, it will be immediately eligible for n’t contraction. 

Second, and in contrast to the [+obstruent] class that had been expanding the num- 
ber of n’t contractible auxiliaries, the [-obstruent] class has been steadily losing mem- 
bers. Interestingly, the [-obstruent] auxiliaries were quite systematically available for 
n't contraction by the end of the 17th century (Brainerd 1989). Of special interest are of 
course those that were n’t contracted in the past but are presently gapped. According 
to Brainerd’s study, the first instance of shan’t appeared in 1631, mayn’t in 1674, daren’t 
in 1701: all three are now gapped. The very fact that they fall out of usage points to 
the non-productivity of n’t contraction for these [-obstruent] auxiliaries: in general, a 
productive rule would have gained rather than lost members. 

How, we wonder, did the [-obstruent] class lose its productivity? Much more detailed 
historical investigation will be needed but an interesting hypothesis can be offered as 
follows. The historical development of n’t contraction may mirror the trajectory of lan- 
guage acquisition by children; that is, ontogeny may recapitulate phylogeny. Our dis- 
cussion of children’s n’t contraction in modern American English suggests that the use 
of ain’t for am not, which children probably acquire later during acquisition, increases 
the number of exceptions for the contraction process. It is conceivable that the emer- 
gence of ain’t, an unpredictably contracted form of am not, was also the culprit for the 
breakdown of productivity. 

Historically, an’t/a’nt surfaced as the contracted form of am not between 1673 and 
1690. But by the early 1700s, an’t/a’nt began to be used for both am not and are not (Brain- 
erd 1989: 186). Whatever the phonological cause for this convergence, or how/when ain’t 
joined the fray, the effect is that am not no longer had a predictable form of contraction. 
If our analysis of children’s amn’t and willn’t is correct, then we would find amn’t and 
ain’t to be in complementary distribution: If a dialect does not allow ain’t for am not, 
amn’t would be possible; otherwise amn’t would be gapped. 

The most direct evidence for this suggestion comes from the dialectal distribution of 
amn't, and its correlation with ain't. The OED notes that amn't is present in “nonstan- 
dard” American English and various northern parts of the UK. There is little to sug- 
gest that amn’t is possible in American English at all; all the five occurrences in COCA 
come from Scottish and Irish writers.® It is remarkable, then, that Scotland and Ireland 
have “traditionally completely ain’t-free dialects” (Anderwald 2003b: 520): it is precisely 
in these regions where amn’t is robustly attested, both in the century-old The English 
Dialect Dictionary (Wright 1898) and in recent dialect surveys of English (Anderwald 
20032)." 

Before I conclude this section, it is important to clarify the scope of the present anal- 
ysis. The Tolerance Principle, through Conditions on Gaps (4), can identify defective 
morphological category where gaps may emerge. Such categories are defined by the 


5 The corpus of child-directed American-English, surprisingly, contains one instance of amn't: "I am stirring, 
amn't I?” It was produced by Colin Fraser, on staff in Roger Brown's Harvard study of language acquisition 
(1973). A little Internet research reveals that Fraser, later a Cambridge scholar with a few psychology 
textbooks to his credit, is a native of Aberdeen. 

7 I would like to thank Gary Thoms for discussion for the distribution of amn't in Scottish English. 
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structural descriptions of rules. It does not predict, at least synchronically, which items 
within these categories will be gapped. That issue, in my view, is completely a matter 
of usage frequency: if the inflected form of an item in a defective category is used very 
rarely or not at all, it will be gapped. Of course, it is also possible that no gaps are found 
in a defective morphological category, if all items happen to be inflected sufficiently fre- 
quently. In that case, however, we do predict that if a novel item matches the structural 
description of a defective category, the speaker will be at a loss to produce an inflected 
form. Thus, the emergence of gaps, just as the calibration of productivity, is determined 
by the composition of the input data. Finally, the preliminary work on the history of n’t 
contraction suggests that the Tolerance Principle can be applied to the study of language 
change. It makes concrete predictions about productivity — the rules that could gain new 
members, and the rules that could only lose existing members - as long as the relevant 
values of N and e from historical data can be reliably estimated. The reader is referred 
to Yang 2016 for a case study of the so-called dative sickness in Icelandic morphosyntax. 


4 Gaps in I-language 


Halle's classic paper (1973) contains the much criticized proposal that gaps are caused by 
the [+Lexical Insertion] feature associated with certain forms. As noted earlier, this kind 
of lexical conservatism is difficult to reconcile with the unbounded generativity of word 
formation, and similar approaches using indirect negative evidence are also unlikely to 
succeed. But in a footnote of that very paper, Halle proposes an alternative approach 
which he himself regards as equivalent but has almost never been discussed by other 
researchers: 


The proposal just sketched might be modified somewhat as regards the treatment 
of words formed by rules that traditionally have been called “nonproductive”. One 
might propose that all words formed by non-productive rules are marked by these 
rules as [-Lexical Insertion]. The smaller subset of actually occurring words formed 
by such rules would then be listed in the filter with the feature [+Lexical Insertion]. 
... In other words, it is assumed that words generated by a productive process are 
all actually occurring and that only exceptionally may a word of this type be ruled 
out of the language. On the other hand, words generated by a nonproductive rule 
are assumed not to be occurring except under special circumstances. In this fashion 
we might capture the difference between productive and nonproductive formations 


(5). 


Hetzron (1975), while arguing against Halle's [+Lexical Insertion] proposal, makes 
essentially the same suggestion. Rules are either productive or lexicalized, and gaps 
arise in the unproductive corners of the grammar. His conception of gaps can be strongly 
identified with the Elsewhere Condition, a critical component of the present theory: 


The speaker must use ready-made material only for “exceptional” forms, while ev- 
erywhere else he could very well “invoke the word formation component". Tech- 
nically, this can be represented by a disjunctive set of rules where idiosyncratic or 
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“exceptional” formations are listed with as much explicitness as necessary, while 
the general word formation rules would appear afterward, with the power to apply 
“to the rest” (871). 


That is, gaps arise when productivity fails. The problem of gaps thus reduces to the 
problem of productivity. Some subsequent proposals have adopted a similar approach 
(Albright 2009; Baronian 2005; Hudson 2000; Maiden & O’Neill 2010; Pullum & Wil- 
son 1977; Sims 2006), including Steve’s own account (2010): gaps result from conflicting 
forces in word formation such that the output form becomes unpredictable and thus 
unrealized. The Tolerance Principle provides a precise solution of what makes a rule 
productive, and its application to gaps reinforces the general position that gaps and pro- 
ductivity are two sides of the same coin. 

The Tolerance Principle is a provable consequence of the Elsewhere Condition and 
follows from the general principle of efficient computation: the child prefers faster gram- 
mars, a “third factor” in language design par excellence (Chomsky 2005). In fact, a 
stronger claim can be made in favor of such an analytical approach. I submit that a 
descriptive analysis of languages, however typologically complete or methodologically 
sophisticated, cannot in principle provide the right solution for productivity. First, as 
noted earlier, the categorical nature of children’s morphological acquisition suggests 
that productivity must be demarcated by a discrete threshold (see also Aronoff 1976: 36). 
But note that such a threshold is empirically undiscoverable. Productive processes will 
lie above the threshold and unproductive processes will lie below, but with arbitrary 
“distance” from it in both cases. Thus, the threshold cannot be regressed out of the data. 
Second, while linguists now have an ever expanding arsenal of investigative tools to 
study productivity, ranging from the Wug test to fMRI to Big Data, the psychological 
grammar is developed without supervision in a matter of few years; these new empiri- 
cal methods presently are at best a description of the speaker’s grammatical knowledge 
and not yet learning models that account for how such knowledge is acquired. Finally, 
even if we were to discover the threshold of productivity through a statistical analysis - 
e.g., a productive rule must hold for at least 85% of eligible words — it would still remain 
mysterious why the critical value is exactly what it is, rather than 80% or 90%. 

In other words, an I-language approach to productivity is needed, one which builds 
exclusively on the inherent constraints on language and cognition that all children have 
access to, with deductively established properties that must hold universally across lan- 
guages. The study of language as a part of human biology, I believe, is an approach 
that Steve endorses and pursues (Anderson & Lightfoot 2002), which can be seen in his 
writings on morphology and related issues (Anderson 2010a; 2015). 

Finally, a personal note. It is no exaggeration to say that I owe my professional career 
to Steve. He managed to create a position for me at Yale, which kept me close to my 
young family and thus linguistics, and further away from the seductive fortunes in the 
tech sector. It was also Steve who taught me, more effectively than anyone, the difference 
between linguistic evidence and rhetoric. It has been a privilege to learn from him. To 
figure out how to wake up irregular took over 15 years; the answer, I hope, is to his 
satisfaction. It may once again win me a spot, this time in the Linguistic Club of Ashville, 
North Carolina. 
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Syntax and Morphosyntax 


Chapter 12 


Special clitics and the right periphery in 
Tsotsil 
Judith Aissen 


University of California, Santa Cruz 


This paper documents the distribution of the definite enclitic =e in Tsotsil (Mayan), a clitic 
which occurs on the right periphery of utterances. On the basis of this distribution, it is 
argued (contra some restrictive theories of clitic placement) that =e cannot reach its surface 
position in the syntax, but must be positioned by the phonology. The property of =e which 
determines its placement is its obligatory association with the prosodic peak of the intona- 
tional phrase, a peak which is located at the right edge of that phrase. The relation of =e 
to several other elements which likewise occur at or near the right periphery of the intona- 
tional phrase in Tsotsil is considered, and a possible historical scenario which can account 
for the properties of =e is suggested. 


1 Introduction 


This paper has two goals. The first is to document more fully than has been done pre- 
viously the distribution of the definite enclitic =e in Tsotsil (Mayan), a clitic which is 
restricted to the right periphery of utterances, (§2-§3).’ The second is to suggest that =e 
is a special clitic in the sense of Anderson (2005) (following Zwicky 1977): “a linguistic el- 
ement whose position with respect to the other elements of the phrase or clause follows 
a distinct set of principles, separate from those of the independently motivated syntax 
of free elements in the language” (31-32). The property of =e which makes it “special” is 
the extent to which it may be separated from the phrase in which it is licensed (§4).? In 
the analysis proposed here, this separation results from the requirement that =e function 
as the prosodic peak of the intonational phrase in which it occurs (§5), a requirement 
which can place it at a significant remove from its syntactically-motivated position. The 
requirement of prosodic prominence is unusual for a clitic. Anderson (2005) emphasizes 
the fact that clitics cannot be defined by the absence of “accent”, as a clitic can bear an 


! The distribution of this enclitic is noted in Aissen (1992: 61) but without much supporting data or discussion. 
It is also discussed in Skopeteas (2010) as part of a broader treatment of terminal clitics in Mayan languages. 
? This property is emphasized in Skopeteas (2010). 


Judith Aissen. 2017. Special clitics and the right periphery in Tsotsil. In Claire Bowern, 
| Laurence Horn & Raffaella Zanuttini (eds.), On looking into words (and beyond), 235- 
262. Berlin: Language Science Press. .9281/zenodo. 
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accent if it happens to fall in an accented position within a larger prosodic constituent. 
But cases in which a clitic is required to occupy such a position - and will reorder in 
order to reach it — have not, to my knowledge, been documented. $6 speculates on how 
-e might have come to be associated with the phonological properties that force it to its 
surface position. 

The fact that =e can occur outside the syntactic domain in which it is syntactically 
licensed poses a challenge to theories which hold that clitics reach their surface posi- 
tions through syntactic operations, e.g., Bošković (2000) and Bermüdez-Otero & Payne 
(2011). For them, even clitics which are pronounced in prosodically determined positions 
nonetheless reach those positions in the syntax, with the role of phonology limited to 
filtering the outputs of a possibly overgenerating syntax. $4 suggests that this view is dif- 
ficult to maintain in the case of =e. It thus adds to a body of work which has argued that 
phonology can determine word order, especially in the case of weak elements (Halpern 
1995; Chung 2003; Agbayani & Golston 2010; Agbayani, Golston & Ishii 2015; Bennett, 
Elfner & McCloskey 2015). 


2 The definite enclitic in Tsotsil 


2.1 The basics 


All dialects of Tsotsil have at least one enclitic which is associated with definite determin- 
ers, as well as with several other elements. The dialects differ with respect to how many 
such clitics they have, how many determiners they have, and what other elements the 
clitics associate with. Under discussion here is the dialect of Zinacantec Tsotsil (Z Tsot- 
sil). Z Tsotsil has one such clitic, =e.3 Among other elements, =e is associated with both 
of the definite determiners, li (PROXIMATE) and ti (REMOTE) (this association is indicated 
in examples by an overbar).* 


[ 1 
(1) a. bat lati vinik=e. 
CP-go CL DET man-DEF 


‘The man went (they say). (Laughlin 1977: 28) 


3 Tsotsil is spoken in Chiapas, Mexico by some 400,000 people. Claims made here about Zinacantec Tsotsil 
are based on a large body of text material and work with five native speakers over a number of years. Texts 
include naturally occurring speech, texts originally written in Tsotsil, and texts translated from Spanish 
to Tsotsil (the New Testament, cited as NT). Grammatical examples are almost all taken from texts; un- 
published sources are cited as AUTHOR. Examples cited as ungrammatical have been checked with several 
speakers and their impossibility is consistent with the patterns seen in the text material. 

4 Like other Mayan languages, Tsotsil is verb-initial, usually V(O)S. It is also a head-marking language with 
ergative alignment. Affixes glossed ERG, ABS, GEN express q features of arguments on agreeing nouns and 
verbs. Absolutive 3rd singular has no exponent and is not indicated in examples. Orthographic symbols 
have the expected values except for x = [f], j = [x], ch = [tf], and ' = [?] (except in symbols for ejectives, p; 
t, ts’, ch’, k’). 
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cual 
b. Buy li j-ve’el=e? 
where DET GEN.1-meal-DEF 


"Where is my meal?’ (Laughlin 1977: 57) 


The deictic distinctions made by determiner+enclitic are fairly subtle and both determin- 
ers can be translated by English ‘the’. More salient distinctions are made by incorporat- 
ing deictic adverbs into the pp. As these examples suggest, =e occurs in a “final” position 
and I sometimes refer to it as a TERMINAL CLITIC. This distinguishes it both from second 
position clitics (e.g., the reportative clitic la in (1a)) and from terminal elements which 
are not clitics (e.g., those discussed in 85.2). 


2.11 Licensing 


There is a dependency between the definite determiners and -e: the determiners almost 
always co-occur with =e. Written texts rarely omit it, and speakers judge sentences with- 
out it to be "incomplete". In spoken language, -e is sometimes omitted, perhaps due to 
performance factors, to register, to individual speaker style, or to some other factor. The 
claims made here hold for relatively careful speech and for written texts. Other elements 
which license -e include a set of deictics which function as demonstratives and adverbs, 
as well as certain subordinators. The lexical elements which license -e in Z Tsotsil are 
shown in Table 1. The determiners li and ti figure in many of the temporal adverbs and 


Table 1: =e licensors in Zinacantec Tsotsil 


Category Items 


Definite determiner li (PROX) 
ti (DISTAL) 


Spatial demonstrative/adverb ` li’ ‘(this) here’ 
le’ (that) there’ 
taj ‘(that) over there’ 


Temporal adverb lavi ‘today’ 
Subordinators ti (complementizer) 
ti mi ‘if? 


(ti) kalal ‘when’ 
(ti) yo’ ‘place where’ 


subordinators listed in Table 1: in the third category, lavi ‘today’ is derived from li avi; in 
the fourth, the complementizer ti may be the determiner, serving to nominalize a clause; 
mi is the polar question particle, but always occurs with ti when it introduces the prota- 
sis to a conditional; k'alal ‘when, the time when’ frequently occurs in collocation with 
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ti, as does yo’ (‘place where’). I assume then that =e realizes the feature [+DEF] in this 
dialect? Examples (2a,b) show =e licensed by elements other than determiners: 


(2) a. Och-an ech’elli’ ta ch’en=e! 
enter-IMP DIR here in cave-DEF 


'Enter the cave here! (Laughlin 1977: 71) 


b. Kalal i-k'ot ta s-ch'en-e... 
when cP-arrive P ERG.3-cave-DEF 


"When he arrived at his cave..’ (Laughlin 1977: 72) 


Aside from the qualification noted in fn. 5, elements which are not [+DEF] do not 
license =e in Tsotsil. This includes lexical categories (nouns, verbs, adjectives), related 
semi-functional categories like auxiliaries, and functional categories like the indefinite 
article, prepositions, negation, focus markers, coordinators, etc. Thus, =e does not occur 
in the position marked by the asterisk in any of the following examples as none of them 
contains an appropriate licensor. 


* 


(3) a. S-nup la ta be jun tsebun 
ERG.3-meet CL on path INDF girl PAR 


‘He met a girl on the path’ (Laughlin 1977: 306) 
b. I-k’opoj la tal ta vinajel *. 

CP-speak cr coming P heaven 

‘He spoke on arriving in heaven, (NT: Mark 1,11) 
c. Ta xax-’och kok?’ ok’ob z 


ICP CL AsP-enter fire tomorrow. 


‘The war will start tomorrow: (Laughlin 1977: 119) 


2.1.2 Terminal position: 1st approximation 


Examples (1)-(2) suggest that =e occurs at the right edge of the phrase headed by its 
licensor. We will need to revise this, but it is true that =e in pp’s, for example, must 
follow all post-head material in the phrase, including modifiers (4a,b) and possessors 
(4c). There are no other possible positions for =e in these examples — in particular, it 


=e sometimes occurs without an overt licensor, but still associated with a definite interpretation. Nominal 
cases include 1st and 2nd person pronouns (in certain syntactic positions), proper names (occasionally), 
and headless relatives with definite interpretations (frequently). These are all clearly definite, so associa- 
tion with a [+DEF] head seems unproblematic. Certain semantically dependent clauses can also end in =e 
without an overt licensor being present (e.g., a determiner or subordinator). These usually present back- 
ground (given) information and correspond, for example, to English when or since clauses. Whether =e in 
these cases should be viewed as the realization of a [+DEF] feature or some other related feature is unclear. 
The clausal cases are not directly relevant to present concerns since =e is never separated in these from the 
domain in which it is licensed (the entire clause). Hence I leave them aside. 
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absolutely cannot attach to the head noun nor to the first prosodic word in the phrase 
(these positions are marked with asterisks). 


SERIES 
(4 a. [ti moletik* vone tey ta Ats'am-e]pp 
DET elders ` Jong ago there P Salinas-DEF 


‘the elders of long ago from (there in) Salinas’ (AUTHOR) 


EEGENEN 
b. [ti anima * j-muk’tot=e]pp 
DET late GEN.1-grandfather-DEF 


‘my late grandfather (AUTHOR) 


c. [li j-me’ * [li vo’on=e]pp|pp 
DET GEN.1-mother DET PRO.1SG-DEF 


‘my mother’ (AUTHOR) 


(4a-c) come from texts in which the pp is a topic. These occur “external” to the clause and 
are thus isolated from the effects of other elements which (as we will see below) interact 
with the position of =e. 


2.13 Coalescence 


An important property of =e is COALESCENCE. In (4c), the larger pp contains two licensors, 
each of which should be matched by =e. One (the first li) is the head of the larger pp (the 
possessum), the other (the second li) is the head of the embedded pp (the possessor). The 
right edge of the two pp’s coincide and only a single clitic is possible at this edge. This 
is a general property of terminal clitic systems in Mayan; even when multiply licensed, 
only a single such clitic occurs (within the relevant domain) (Skopeteas 2010). 


2.1.4 Clitic vs. affix 


Though it is generally accepted that “clitic” is a cover term for a diverse set of elements 
and not a formal grammatical category, the term is still used descriptively. To motivate 
the use of the term “clitic” to refer to Tsotsil =e, I survey some of the criteria that have 
been used in the past to distinguish clitics from (ordinary) affixes (Zwicky & Pullum 
1983). All of these align =e more closely with “clitics” than with inflectional affixes. [1] 
it imposes no selectional restrictions on the host, but may attach to members of any 
lexical category that falls in the appropriate right-edge position. In addition to nouns, 
these include verbs, as in (5c), adjectives, particles (see §5.2), and even second position 
clitics like the reportative clitic /a in (5a); [2] there are no arbitrary gaps in the possible 
X=e combinations; [3] the form of the host is not sensitive to the presence of the clitic 
(the clitic triggers no allomorphy and does not participate in lexical phonology); [4] there 
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are no semantic idiosyncracies associated with =e; and [5] =e attaches outside all other 
suffixes, e.g., noun plurals, (5b), and agreement suffixes, (5c). 


PO 
(5) a.a ti vone lase 
TOP DET long.ago CL-DEF 


'as for long ago (they say)' 
leis cl 

b. ti jeneral-etik-e 
DET general-PL-DEF 


'the generals' 


c. li takin ta j-ta-tikotik=e 
DET money ICP ERG.1-find-1PL.EXCL-DEF 


‘the money that we could find’ 


At the same time, =e is prosodically more like an affix than other clitics in the lan- 
guage. Tsotsil has various "simple" clitics, i.e., syntactic words which are prosodically 
weak. Like other words in the language, all of these have an onset, e.g., the interrog- 
ative polarity particle mi, the definite determiners ti, li, negation mu, second position 
modal and aspectual clitics (xa, to, me, la). In contrast though, =e, like many inflectional 
affixes, lacks an onset. Further, except for the second position clitics, the simple clitics 
all precede their complements, while =e follows everything in its phrase. 

If “clitic” is not a formal grammatical category, then the properties of =e must follow 
from its analysis as a word or affix. There are a number of possible analyses that could 
be considered. We could analyze it as a prosodically deficient word which heads its own 
phrase within the pp, as shown in (6). 


(6) DP 
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Here, =e heads a DEFP which is selected by D and which itself takes a NP complement. 
We could account for the phrase- final position of =e by assuming that =e requires that 
its specifier be filled, and that the NP complement raises to its left to satisfy this require- 
ment (this would follow proposals of Cinque 2005 and Simpson 2005, who account for 
the phrase-final position of demonstratives in various languages via leftward movement 
of NP within DP).° Another possibility would be to analyze =e as inflectional morphol- 
ogy which spells out a definiteness feature associated with the noun phrase on the right- 
most terminal of that phrase, much as Miller (1991) analyzes the French deictic clitics 
-ci and -Ià. A third possibility is to analyze =e as a phrasal affix, analogous to the treat- 
ment that Anderson (2005) proposes for the English genitive marker ’s and somewhat 
tentatively for definitive accent in Tongan. In this approach, =e would be introduced 
post-syntactically by the phrasal morphology as spell-out of the feature [+DEF] on DP 
and its surface position would be determined by a constraint operating within an OT 
constraint system. Any of these approaches will have to confront the issues discussed 
in the next section; how well each would fare is not a question I address here. Going for- 
ward, I will assume the analysis sketched in (6), according to which -e is a prosodically 
deficient word which is introduced in the syntax. 

Under any of these analyses, -e is licensed within the phrase headed by its licensor, 
usually pp, and I take this phrase to be the “syntactic domain" of the clitic. The puzzle 
that gives rise to this paper is the fact that =e does not in fact always close the phrase 
in which it is licensed but often occurs considerably further to the right. I argue below 
that this is because =e can occur only at the right edge of an intonational phrase (iP), 
an edge which is often located further to the right than the right edge of the phrase in 
which =e is licensed. The evidence for this is presented in 83; in $5, I consider why =e is 
constrained in this way. 


3 Prosodic constraints on -e 


Although -e frequently appears at the right edge of the phrase in which it is licensed, the 
larger descriptive generalization about its position is not syntactic, but prosodic (Aissen 
1992; Skopeteas 2010): 


(7) =e occurs at the right edge of the iP which contains its licensor. 


Descriptions of Z Tsotsil characterize prosodic prominence at two levels - the word 
and the phrase. At the word level, stress falls on the initial syllable of the root; at the 
phrase level, it falls on the final syllable of the iP (Laughlin 1975,23; Haviland 1981,14) 
(stress being predictable, it is not marked in the orthography). I assume then that the 
final syllable of the iP is its prosodic peak.’ A detailed phonetic study of intonational 


Note that this movement would violate the anti-locality constraints proposed in Pesetsky & Torrego (2001) 
and Abels (2003) which preclude movement of the complement of a head (a phase head in Abels' account) 
to the specifier of that head. 

7 The association of prosodic prominence with the final syllable of the tP is reported for other Tsotsil dialects 
(Cowan 1969: 4; Delgaty & Sanchez 1978: 11) as well as for the sister language Tseltal (Shklovsky 2011; Polian 
2013); see Bennett (2016: §6.1) for an overview of lexical and phrasal stress in Mayan. 
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phrasing in Tsotsil does not yet exist, but some preliminary observations are possible. 
The final syllable is associated with characteristic boundary tones. The most common 
pattern involves a rise in pitch on the vowel of the final syllable, with the larger context 
determining whether that rise is sustained throughout the syllable or followed by a fall 
(relevant factors include whether the (D is final in the utterance or not (as in the case 
of topics, for example, §3.2)). The final syllable of the iP is sometimes followed by a 
significant pause and when it is, the vowel of that syllable is often lengthened. 

Some of these properties are evident in Figure 1, taken from a naturally-produced 
narrative by a Z Tsotsil speaker; this example occurs utterance-finally and shows a final 


fall. 


180 
N 
EJ 
= 1104 
g 
= 
li ta lot(i) ko ti(k) ta a nil 
40 
71 161 146 131 155 168 114 344 
Figure 1: Pitch track and waveform for (8). 
(8) L-i-tal-otkotik ta anil. 


CP-ABS.1-come-1PL.EXCL in hurry 
"We came in a hurry. (AUTHOR) 
A key observation is that because =e aligns with the right edge of the iP, then, what- 
ever else it is, it is the final syllable of the iP. It thus carries the boundary tone, and is 


often followed by significant pause and lengthened. This is illustrated in Figure 2, which 
is based on (9), from the same narrative as Figure 1; this phrase is also utterance-final. 


at pee | 
(9) ..te  tas-na li Maryan Papyan=e. 


there P. GEN.3-house DET Mariano Papyan-DEF 


“there in the house of Mariano Papyan’? (AUTHOR) 


The analysis proposed in $5 hinges on the obligatory association between -e and the 
prosodic peak of iP. 
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180 


my i | | ! b 
ki IJ T ER li 2l 


1104 


Pitch (Hz) 


te ta sna li ma r(y)an pa pya ne 


40 
76 119 189 87 170 148 155 196 294 


Figure 2: Pitch track and waveform for (9). 


As in other languages, utterances consisting of a simple clause are parsed as a single 
(D. There are also two structures in Z Tsotsil which are associated with obligatory (P 
breaks, resulting in utterances with multiple ıP’s, and therefore multiple positions for 
=e under (7): an external topic is parsed as an tP separate from that of the following 
comment clause and an extraposed cP is parsed as a iP separate from that the preceding 
matrix Clause H Other complements, as well as relative clauses, are usually not extraposed 
and they are prosodically integrated into the iP of the matrix clause. In this section we 
provide support for (7), starting with simple clauses (83.1), then considering structures 
with multiple |P's (§3.2-§3.3), and finally syntactically complex structures which map to 
a single iP (83.4). 83.5 suggests an algorithm for mapping syntactic structure to prosodic 
structure. 


3.1 Simple clauses 


In utterances consisting of a single clause, regardless of where =e is licensed, it appears 
at the right edge of the iP corresponding to the clause. When the licensing phrase itself 
is clause-final, as in (1)-(2), that phrase has the appearance of being closed by -e. But if a 
clause contains several phrases which are headed by licensors, no phrase which occurs 
medially can end in =e. Adding it in in the positions of the asterisks in (10) and (11) is 
impossible. 


8 Some adverbial clauses are obligatorily parsed as separate iP's and some only optionally. These are not 
discussed here, but see Aissen (1992: 59). 
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D ee 1 
(10) S-jipan la ta=ora [ti okil *][ti tul] unse. 
ERG.3-tie CL right.away DET coyote DET rabbit PAR-DEF 
"The rabbit tied Coyote up right away: (Laughlin 1977: 160) 


(11) Ls-ta la tal [li aniyo *] ta yut=vo’ [li choy] un=e. 
CP-ERG.3-find CL DIR DET ring P inside.water DET fish PAR-DEF 


‘The fish found the ring in the water’ (Laughlin 1977: 354) 


One might think that =e is simply omitted when the licensing phrase does not occur 
clause-finally. But examples like (12)-(14) show otherwise. Here (and generally), the 
clause-medial pp does license =e but the clitic is delayed to the end of the clause. 


a A] 
(12)  L-i-'abtej-otikotik scht uk [li Kumpa Lolļap ta museo-e. 
CP-ABS.1-work-1PL.EXCL with DET Compadre Lol P museum-DEF 


We worked with Compadre Lol at the museum. (Laughlin 1980: 25) 


ae ne peril 
(13) Ch-och xa [li k'ok']ay [ok'ob]aay [ta Nibak]pp=e. 


ICP-enter CL DET fire tomorrow P Ixtapa-DEF 


‘The war will begin tomorrow in Ixtapa’ (Laughlin 1977: 119) 


(14) Ta=x-[y]-ak’-ik [ti kantela]ay [noxtok];a,-e. 
ICP-ERG.3-give-PL DET candle ^ too-DEr. 


‘They too were offering the candles? (AUTHOR) 


There are two properties to note in these examples. First, =e must be licensed by the 
determiner since there is no other licensor present; and second, the intervening pp es and 
adverbs are not part of the pp headed by the licensor. In (12)-(14), they modify the entire 
sentence (or the predicate), not the head noun. In (14), the adverb noxtok ‘too, also’ 
is associated with additive focus on the subject ‘they’ (= shamans in the town under 
discussion) not the object (‘the candles’) - the preceding discourse describes shamans 
from a neighboring town offering candles; the current utterance asserts that the ones 
in this town too were offering candles. =e attaches then outside its syntactic domain, 
assuming that domain to be the pp headed by its licensor. 

Going back to (10)-(11), the right conclusion, I think, is that both determiners require 
=e, but that that requirement is satisfied by the single, clause-final enclitic (see also 
Skopeteas 2010). These cases too then involve coalescence, but in a configuration dif- 
ferent from the one illustrated by (4c). In (4c), the right edges of the two pp’s which 
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license =e coincide, but here they do not. (10)-(11) actually provide another kind of evi- 
dence that =e does not always close its syntactic domain: the particle un which occurs 
in both examples (and in many subsequent ones) is not part of the preceding pp, yet 
whenever it occurs, it separates =e from its licensing phrase, (see §5.2 on un). 

Examples like (15) and (16) provide further evidence that =e can occur outside its syn- 
tactic domain: they show that when the phrase that licenses =e is preposed, =e still 
surfaces in post-verbal position, at the right edge of the clause. (15) is from a narrative in 
which a mother gives advice to her son, (16) from one about the Kennedy assassination. 


(15) [Ta sba me l-av-ajnil],, ^ ch-a-muy=e, 
on top CL DET-GEN.2-wife ICP-ABs.2-climb-DEF 
‘It’s on top of your wife that you should climb [not onto the rafters]? (Laughlin 
1977: 56) 


(16) Ja’ nox [li viniketik]g, i-laj-ik ta bala=e. 
Foc only DET men cP-end-PL P bullet-bEF 
‘[The women weren't hit by the bullets], it was only the men that were wounded 
by bullets’ (Laughlin 1980: 15) 


In (15), a PP has been fronted into focus position, as sketched in (17) (the larger con- 
text makes clear that we are dealing with contrastive focus in both (15-16)). Note that a 
fronted focus does not occasion an iP break (Aissen 1992). 


(17) = structure of (15) 
IP 


The licensor for =e in (17) is the head of the circled pp, which is embedded quite deeply 
within the fronted pp, but the enclitic does not close that pp. Instead it surfaces clause- 
finally. (16) is a cleft construction where the focus occurs preverbally. Again =e is licensed 


245 


Judith Aissen 


by the head of that pp but occurs clause-finally (the verb phrase which follows the focus 
does not modify the focus and is presumably not embedded in it). 

With respect then to simple, monoclausal structures, examples (12)-(16) show (in vari- 
ous ways) that in Z Tsotsil, =e does not in general close the phrase headed by its licensor. 
A closer approximation is that it closes the clause containing the licensor (though we 
will see shortly that this is not the whole story either). This holds whether the licen- 
sor is a determiner or some other element, e.g., a deictic adverb. (18)-(19) show that 
an =e licensed by a deictic adverb also occurs clause-finally, again separated from the 
phrase containing the licensor by intervening material (in (19), the adverb functions as 
the clausal predicate). 


[e] 
(18) J-tsak-tik [lavi] [ta k'in]-e. 
ERG.1-grab-1PL.1nc today P fiesta 
"Let's arrest him today at the festival" (NT: Matthew 26:5) 


(19) Muk’ l? s-malal=e. 
NEG here GEN-husband-DEF 


‘Their husbands weren't around here [they had gone to the lowlands]? (Laughlin 
1977: 101) 


3.2 Topics 


As in many other languages, external topics in Tsotsil are parsed as separate iP's (by 
"external topic", I mean one which is attached outside the sentence, often entering into 
an anaphoric relation with a pronoun inside the sentence) (Aissen 1992). Topics are 
usually definite in Tsotsil and therefore are almost always closed by =e (the iP break is 


indicated by ^|"): 

(20) Ti moletik vone tey ta Ats'am-e, |i-s-tsob la s-ba-ik 
DET elders Jong ago there P Salinas-DEF cP-ERG.3-gather CL GEN.3-RR-PL 
ta snuts-el li  biyaetik-e. 


P chase-NOMZL DET Villistas-DEF 


"Ihe elders of long ago (from) there in Salinas gathered to chase the Villistas’ 
(AUTHOR) 


(21 Ti anima j-muk’tot=e |x-ok' xala sutel tal. 
DET late | GEN l-grandfather-bEF AsP-cry CL CL returning here 


‘My late grandfather returned crying’ (AUTHOR) 
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3.3 Complex clauses with CP complements 


CP complements obligatorily extrapose in Tsotsil. While normal order in transitive 
clauses is vos, when o is a cP complement, it occurs utterance-finally (Aissen 1992). 


Ir A] 
(22) I-y-il ti s-me un=e | 


CP-ERG.3-see DET GEN.3-mother PAR-DEF ~N 
ti  muk’=buta s-sa’ y-ajnil ti s-krem un=e 
COMP NEG ICP ERG.3-seek GEN.3-wife DET GEN.3-son PAR-DEF 


‘His mother saw that her son was never going to find a wife’ (Laughlin 1977: 55) 


Extraposition is associated with an obligatory iP break and, as expected, the matrix and 
cp complements form separate domains for clitic placement: the =e licensed by the first 
determiner closes the first iP and the one licensed by the second closes the second iP. 
Extraposition of cP complements also occurs in ditransitive clauses. While the THEME 
precedes the Goat when both are nominal, the THEME follows the coAr when it is a CP: 


[o cy Cael | l 


(23) Ikalbeli kumpa Lol un-e In  ywun chicham  xaun-e. 
Ltold DET compadre Bob PAR-DEF comp because I.was.dying CL PAR-DEF 


‘I told Compadre Bob that I was feeling awful’ (Laughlin 1980: 30) 


Again, extraposition forces an iP break between the matrix clause and its extraposed 
complement. And as above, the two clauses form separate domains for clitic placement. 


3.4 Prosodically integrated subordinate clauses 


While cp complements extrapose, there are other embedded clauses which do not and 
thus remain in their base position. These include 1P complements (selected by verbs of 
perception and some other higher predicates) as well as relative clauses. Prosodically 
these do not form separate iP's, but are integrated into the iP of the matrix clause (see 
An 2007 on languages in which restrictive relatives do not form separate (D sl, 

Consider the 1P complement in (24). It remains in its internal position and is followed 
by the matrix subject: 


(24) Mi ja'uko-bu y-a'i [lok' ti  y-ajnil “Jip ti vinik un=e. 

NEG even ever ERG.3-feel leave DEF GEN.3-wife DET man PAR-DEF 

‘The man didn’t even feel his wife slipping out’ (Laughlin 1977: 49) 
There is no extraposition here and the entire utterance is pronounced as a single iP. If 
=e closed the (smallest) clause in which was licensed, we would expect one to surface 


in the position of the asterisk. But =e is not possible there. Instead, it appears that the 
enclitic licensed within the complement is delayed until the end of the entire utterance, 
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where it coalesces with the one licensed by the subject. Consistent with (7), the enclitic 
licensed within the complement clause is pronounced at the right edge of the iP which 
contains its licensor. 

Relative clauses (nc) also generally do not extrapose. Relative clauses with external 
heads do not occur utterance-internally (if necessary, the sentence is restructured so 
that they occur utterance-finally or sentence-initially as part of the topic), but headless 
relatives (or better, “light-headed” relatives involving a determiner + cp) can? In (25) 
and (26), the nc is sandwiched between the matrix verb and the matrix subject. 


(25) Y-il-oj [ti [bu kot ti "Kal "locln vinik un=e. 
ERG.3-see-PRF DET where arrive DET Spook DET man PAR-DEF 


"Ihe man saw (the place) where the Spook landed’? (Laughlin 1977: 63) 


(26) Ly-ai la [taj [k'alal ch-lok' tal taj 
CP-ERG.3-feel CL DET when 1cr-leave DIR DET 


chon  *]ac]taj ants ` un-e, 
serpent DET woman PAR-DEF 


‘That woman felt (the moment) when that snake left? (Laughlin 1977: 371) 


Like 1P complements, nc's do not constitute separate tP’s, but are parsed together with 
the matrix clause. Examples (25)-(26) show that an =e licensed in such a relative clause 
is realized not at the edge of the relative clause (marked here by an asterisk), but again 
at the right edge of the entire utterance where it coalesces with the clitic licensed by the 
matrix subject. 


3.5 Summary 


The position in which -e is pronounced in Z Tsotsil does not coincide with the edge of 
the phrase in which is licensed, nor even with the edge of the (minimal) clause in which 
it is licensed. Rather, it coincides with the right edge of iP containing its licensor. 

While it is not necessary for our purposes to provide an algorithm for mapping syn- 
tactic structure to prosodic structure (what is important is that iP breaks fall in certain 
positions, not why they fall there), there is a simple principle which determines this map- 
ping if we assume that external topics and extraposed clauses are both adjoined at the 
root of the sentence (Aissen 1992). Assuming that an element X which adjoins to Y is not 
dominated by Y, then neither topics nor extraposed clauses are dominated by any node. 
Hence, like simple clauses, the nodes which define these constituents are “undominated”. 
In this respect they are like root nodes and, following Frank, Hagstrom & Vijay-Shanker 


? I take the nc to be a cP since it contains a fronted wr expression in (25) and a complementizer in (26). 
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(2002), I will refer to them as such. With this understanding, the mapping from syn- 
tax to iP can be characterized as a match between a certain syntactic constituent (one 
dominated by a root node) and a corresponding prosodic constituent (an iP) (on MATCH 
constraints, see Selkirk 2009 and Elfner 2012; on the relevance of the root to defining iP, 
see Downing 1970; Nespor & Vogel 1986; Selkirk 2009). The formulation in (27) is based 
on Bennett, Elfner & McCloskey (2015) and Elfner (2012), 


(27) MarcH Root 
If a root node R in a syntactic representation S dominates all and only the set of 
terminal elements {a,),c, ..., n}, then there must be in the phonological represen- 
tation P corresponding to S an iP which dominates all and only the phonological 
exponents of a,b,c, ..., n. 


When Marcu Root is satisfied in Tsotsil (as it appears always to be), simple clauses and 
some complex structures are parsed as single iP's; extraposed CP's and topics are parsed 
into their own iP's. 

In 85, I develop an account of clitic placement in Tsotsil in which the syntax positions 
=e at the edge ofthe phrase in which it is licensed, per (6), and the phonology accounts for 
its subsequent dislocation to the right edge of iP. This attributes a more significant role 
to the phonology than some theories of clitic placement permit. Hence before turning 
to the phonological account, I consider the prospects for accounts in which phonology 
plays at most a filtering role in the placement of =e in Tsotsil. 


4 A syntactic account? 


While recognizing that the positioning of some clitics is sensitive to prosodic constit- 
uency, some recent theories of clitic placement propose that the role of phonology is 
limited to filtering outputs from the syntax. Consider, for example, Bošković (2000)'s 
account of second-position clitics in Serbo-Croatian. Bošković argues that these clitics 
attach to the first prosodic word within an iP. This is a prosodic generalization, but in 
his account, the prosody does not directly determine the position of second-position 
clitics. Rather, clitics reach their surface positions through syntactic mechanisms. Since 
syntactic mechanisms sometimes place clitics in other than "second" position, PF filters 
out derivations in which the clitics do not suffix to the initial prosodic constituent in the 
iP. Bermüdez-Otero & Payne (2011) propose that all cases of prosodic conditioning of 
clitic placement can be handled in the same way, i.e., clitics are positioned by a possibly 
over-generating syntax, with ill-formed configurations filtered out at PF. 

The problem posed by -e is clear. If its syntactic domain is the phrase headed by its 
licensor (typically, pp), then the syntax should place =e somewhere within that domain. 
However, we have seen that =e can occur outside the phrase in which it is licensed, 
indeed outside the clause in which it is licensed. In fact, it must occur outside that phrase 
(or clause) when it is not iP-final. The only option for an account of clitic placement 
in which phonology does no more than filter outputs from the syntax is to extend the 
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syntactic domain of =e (or the [+DEF] feature which it realizes) beyond the phrase in 
which it is licensed. 

Conceived syntactically, the dependency between the position in which =e is licensed 
and the position in which it is pronounced can span a significant amount of syntactic 
structure — it crosses clause-boundaries including ones which define relative clauses. 
There are various ways that apparent long-distance dependencies are handled, depend- 
ing both on the nature of the dependency and on the particular syntactic model - long- 
distance movement (Transformational Grammar), a sequence of local movements (Min- 
imalism), feature percolation (GPSG/HPSG) and others (Alexiadou, Kiss & Müller 2012). 
It is beyond the scope of this article to develop a syntactic analysis of =e placement, but 
we can point out two properties of the phenomenon that any such analysis must account 
for. One is that the top of the dependency is limited to root (undominated) nodes: =e can 
spell out only at the right edge of an undominated node, and not at the right edge of any 
other node. If movement or percolation are involved, they must therefore be to the root, 
whether that node corresponds to a simple clause, a topic, or an extraposed complement. 
The other is that the bottom of the dependency can be located anywhere within the struc- 
ture dominated by the root. In particular, it can be located within a constituent which 
is otherwise an island for extraction, for example within a pp, as in (15/17) (see Aissen 
1996 for evidence that pp’s are islands for extraction), or a relative clause, (25)-(26) (see 
Aissen 1992). 

It is instructive to consider a particular analysis which would position -e in its low, 
syntactically-licensed position and account for its appearance at the right edge of iP's 
through late, prosodically-conditioned linearization. Bermüdez-Otero & Payne (2011) 
mention this as a possible analysis for cases in which a clitic attaches to a prosodically 
defined domain, like the second position clitics in Chamorro (Chung 2003). They point to 
Linear Syntax (Kathol 2004), a theory of linearization embedded in HPSG, as a possible 
framework for implementation. Linear Syntax imposes precedence relations on sisters 
but, in order to handle discontinuities, permits those relations to be "passed up" the tree 
and then “shuffled” with relations among higher elements. In this way, elements from 
an embedded domain may be separated from one another by elements that belong to 
higher syntactic domains. In the case at hand, =e, linearized, for example at the right 
edge of the phrase in which it is licensed, could be separated from that phrase at higher 
levels, extending its syntactic domain to a higher constituent. 

The question for this account is just what constraints it imposes on the upward “per- 
colation" of precedence relations. In a language which does not in general permit scram- 
bling, which nodes pass precedence relations upwards and which do not? The most 
obvious challenge is posed by the fact that an =e licensed somewhere within a relative 
clause or a PP cannot surface within those phrases if they are not utterance-final, but 
must surface in the matrix. In the shuffling account of examples like (15/17) and (25)-(26), 
the precedence relation between -e and the rest of the licensing phrase (its specifier, 
under (6)) would be obligatorily passed up through the relative clause or pp and then 
shuffled with precedence relations among elements in the matrix clause. Since pp’s and 
relative clauses are otherwise impermeable in Tsotsil, one must wonder why Shuffling, 
but not other syntactic operations, can access elements within them. 
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On the other hand, it is a prosodic fact, independent of anything about =e, that pp’s and 
relative clauses in Tsotsil do not form separate iP's. Hence the fact that an =e licensed 
within them surfaces outside them when they are not utterance-final follows from the 
prosodic generalization in (7). In short, if the relation between =e and the phrase in which 
it is licensed is conceived as a syntactic dependency, its properties are unexpected. But 
if the relation is instead phonological and holds within an iP at a point when syntactic 
structure is no longer relevant, the distribution of -e and its relation to the licensing 
phrase begin to make sense. 


5 A prosodic account 


5.1 Association with prosodic prominence 


I outline here an account of -e in Z Tsotsil. This account shares with Anderson's 2005 
approach to clitic placement the assumption that the surface position of =e is determined 
post-syntactically through an optimization that evaluates alternative positions of the 
clitic against a set of ranked constraints (Prince & Smolensky 1993/2004). It differs from 
Anderson in that -e is not itself subject to a constraint which aligns it with the edge 
of a constituent. Rather the position of =e is motivated by an inherent lexical property, 
namely its association with the prosodic prominence that characterizes the right edge 
of iP's in the language. In this, I closely follow Henderson (2012)’s account of certain 
"status" suffixes in Kiche (also Mayan)? which surface only at the right edge of iP. 
These suffixes attach only to verbs and surface only when the verb occurs tP-finally, (28a). 
Otherwise, the suffix is suppressed, (28b) (accent marks here represent the prosodic peak 
of the utterance): 


(28) a. X-in-tij-ó. 
CP-ERG.1sG-eat-ss 
‘Tate it. 
b. X-in-tij le sub’ 
CP-ERG.1SG-eat DET tamalito 


‘I ate the tamalito’ (Henderson 2012: 775-776) 


Henderson notes that status suffixes are simply omitted from phrase-medial verbs, rather 
than being displaced to tP-final position (see 28b) and attributes this to the fact that the 
suffix is an affix (not a clitic) and attaches only to verbs. He raises the issue of what would 
happen if the element in question were a clitic. The distribution of Tsotsil =e instantiates 
exactly this case: =e is not tied to any particular word class and thus faithful realization 
carries it away from the position in which it is licensed. 

The lexical entry for =e is shown in (29), where the asterisk indicates association with 
the prosodic peak of iP: 


10 These suffixes mark the transitivity status of the predicate and make other distinctions related to mood 
and dependency. 
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(29) x 


€ 


I also adopt Henderson’s constraint set, it being as well-suited to Tsotsil =e as it is to 
the K'iche' status suffixes. The constraints fall into three groups. The first two concern 
the location of prosodic prominence in the iP and are independent of the distribution 
of =e. An alignment constraint (McCarthy & Prince 1993) locates the peak of prosodic 
prominence at the right edge of the iP, (30). CULMINATIVITY (31) limits such peaks to one 
per iP (Hayes 1995). 


(30) Autoen: A peak of prominence lies at the right edge of the iP. 
(31) CULM(INATIVITY): Every prosodic domain has exactly one peak of prominence. 


The second two are faithfulness constraints on the morphology-to-phonology correspon- 
dence (Prince & Smolensky 1993/2004; McCarthy & Prince 1995). REALIZEMorRPH (32), 
a general constraint, calls for faithful parsing of morphemes in the phonology (Kurisu 
2001). IDENTPROM (33) is the key constraint here: it requires that the lexical association 
of =e with prosodic prominence be preserved in the output (Henderson 2012).!! 


(32) REALIZEM(oRPH): Every morpheme in the input has a phonological exponent in 
the output. 


(33) IDENTPROM: if morpheme M has prominence P in the input, then M’, the phono- 
logical correspondent of M, has prominence P in the output. 


Tableau (34) shows the effect of these constraints on the evaluation of an input, that of 
(12), in which the syntactically determined position for =e does not correspond to the 
right edge of an iP. The input in Tableau (34) is a morphophonological representation 
in which syntactic terminals have been spelled-out and in which the hierarchical struc- 
ture of syntax has been replaced by precedence relations and prosodic structure. =e is 
a morphophonological element. Its position is syntactically determined per (6) and its 
association with the prosodic peak is indicated in the input by the asterisk, a morpholog- 
ical diacritic. Candidates for the output are fully linearized phonological representations, 
parsed into prosodic constituents. Prosodic prominence in the iP is marked by an acute 
accent. 

The optimal candidate is [b], which violates none of the constraints shown. However, 
it does violate one which is not shown, LINEARITY, which penalizes outputs which di- 
verge from the precedence relations of the input (McCarthy & Prince 1995).? LINEARITY 
must be lower ranked than any of the four constraints shown in Tableau (34). 


H T have slightly reworded IpEN'TPROM from Henderson to emphasize the distinction between M in the input 
and its correspondent M' in the output. 

12 The high-ranked constraint Marcu Root (27) prevents =e from moving “too far", by requiring that it be 
realized within the same iP as its licensor. 
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(34) Tableau for (12) 


[.li Kumpa  Lol=e* ta museo], IDENT REALIZE 
ALIGN, CULM 

. DET compadre L-DEF P museum Prom MorpH 

a. [.li kumpa lol=é ta museo], x! 


b. «& [...li kumpa lol ta museo-é], 


d. [.li kumpa lol-é ta museó], x! 


l 
l 

c. [....i kumpa lol=e ta mused], poxl 
l 


e.  [...i kumpa lol ta museo], x! 


(35) Lın(EARITY): The precedence structure of the input is consistent with that of the 
output and vice versa. 


When the input has two enclitics, they coalesce in the output. 


(36) S-jipan la ta=ora [ti okil [ti tul] unze 
ERG.3-tie CL right.away DET coyote DET rabbit PAR-DEF 
‘The rabbit tied Coyote up right away: (Laughlin 1977: 160) 


Taking the input to (36) to be [...ti ok'ilze* ti t'ulze"], we can see that the optimal output, 
[b] in (38), violates none of the four constraints (30)-(33): the prosodic peak is aligned 
with the right edge of the iP, there is only a single prosodic peak, the prosodic promi- 
nence associated with =e in the input is preserved in the output, and every morpheme 
in the input has a phonological exponent in the output. The association of input mor- 
phemes to phonological exponents, however, is many-to-one, as indicated by the sub- 
scripts on =e in input and output. Hence the optimal candidate, [b] (236), violates the 
Anti-Coalescence constraint, UNIFORMITY (McCarthy & Prince 1995), as well as LINEAR- 
ITY. Like LINEARITY, UNIFORMITY is ranked below the other constraints shown. 


(37) Uuir(onurry): No element in the output has multiple correspondents in the input. 


(38) Tableau for (36) 


[.ti okile'" ti tul=e*2], ' “IDENT ` REALIZE 


. ALIGN, CULM Lin | UNIF 
...DET coyote-DEF DET rabbit-DEF Prom MORPH 


a. [...tiok’il=é; ti t'ul-é5], * EN" 
mti ok'il ti 'ul-é; 2]. x " 
„ti ok'il ti Pul=é2], * 
. ti ok'ilze; ti tul=é2), l poo l l 
-ti ok'il=é;2 ti Cul), x l l ] x 
..ti ok'il ti tul], ] ! l "m I 
„ti ok'il ti tul=é1=é2]. E * 


a] S) |) 


Candidates not shown include variations on [g] in which one =e or the other does not 
realize the prosodic prominence of the IP, i.e., [...ti t'ul2e;-é?] and [...ti t’ul-é;=e2]. Both 
violate IDENTPROM and the second one violates ALIGN, as well. 
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Some additional facts, not yet presented, show that REALIZEMorpPH must be indexed 
for particular morphemes and that the one which indexes =e is ranked below ALIGN,, 
CULMINATIVITY and IDENTPRom. The definite enclitic =e is not the only morpheme in 
Zinacantec Tsotsil which is lexically associated with the prosodic peak of iP. The other 
is an epistemic particle, a'a, which Laughlin (1975) classifies as an “exclamation” and 
translates indeed!, surely! certainly! of course! a’a does not require licensing, though 
statistically, it tends to occur in utterances with 1st and/or 2nd person arguments and is 
likely cognate with a reduplicated form of the terminal clitic a’ ‘PROXIMATE’ in Yucatec. 
Relevant here is that a’a occurs in the same position as =e, i.e., at the right edge of P, 
with its second syllable functioning as the prosodic peak of iP. 


(39) a. Ta'ajebal li j-ve'el-tik a’a. 
almost.cooked DET GEN.1-meal-1PL.INC EXCLAM 
‘Our meal certainly is about cooked’ (Laughlin 1977: 285) 
b. Ta j-ti’ lavi a’a. 
ICP ERG.1-eat today EXCLAM 
‘Of course I'll eat it today’ (Laughlin 1977: 283) 
c. Ik’-o lè ava! 
take-IMP DEM EXCLAM 
‘Take her!’ (Laughlin 1977: 126) 


d. A li Pineda=e mas mas ts'akal EN 
TOP DET Pineda-DEF more more afterwards EXCLAM 


‘Pineda was later, of course.” (Laughlin 1977: 116) 


=e and a’a compete with one another, with priority given to realization of a’a. Thus =e 
must be omitted when a’a occurs. (39a-c) contain various elements (underlined) that 
otherwise require =e (see Table 1). Here though, a’a entirely precludes realization of =e. 

As an epistemic operator, I assume that a’a occupies a position in the syntax; its exact 
location cannot be determined since it is pronounced only at the right edge of iP. Assum- 
ing that e" and oo" can both be present in the input, one or the other must “disappear”. 
Which is preserved is determined by the ranking of morpheme-specific REALIZEM con- 
straints. In Zinacantec Tsotsil, REALIzE(a'a*) > REALIZE(=e"). The overall ranking of the 
constraints under discussion then is shown in Figure 3. 


5.2 Notes on the right periphery 


I close this section by discussing the relation between the terminal elements =e and a’a, 
and two other elements which “pile up" at the right periphery. The ordering of the four 
is shown in (40): 


(40) 
un -e/a'a che'e 


PAR | DEF/EXCLAM | ‘then’ 
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ALIGN, CULMINATIVITY IDENTPROM REALIZE(a’a") 


ae et 


REALIZE(=e") 


ye 


LINEARITY UNIFORMITY 


Figure 3: Constraint ranking 


The particle un occurs in many of the examples cited above. No meaning (propositional 
or otherwise) has yet been identified for it. Some speakers have the intuition that it 
contributes some nuance of meaning to the sentence; others say that the sentence just 
“sounds better” with it. un has a distribution similar to that of =e and a’a: like them, un 
occurs at the right periphery of root sentences and of topics, and it can separate a matrix 
clause from its extraposed complement. Also like them, it occurs in no other positions. 
Unlike =e, it is not lexically licensed. 

Aissen (1992) analyzed un as an enclitic which aligns with the right edge of ıP. While 
it is true that un always occurs very near the right edge of iP, it does not occur right- 
most when any of the other elements in (40) is present. While it is not yet clear what 
is responsible for its appearance and position, I assume that it is not lexically associated 
with the prosodic peak in iP and that its position is therefore not determined by IDENT- 
Prom. For one thing, as observed in Skopeteas (2010), it does not coalesce with =e (nor 
with a’a). One possibility is that it is present already at Spell-Out at the right edge of iP. 
It would then be present in the input to evaluations like those in (34) and (38), and the 
constraint ranking in Figure 3 would position e and a’a to its right. Another possibility 
is that un is introduced by the phonology for eurhythmic reasons, e.g., to improve the 
prosodic structure of the utterance, perhaps at lower levels of the prosodic hierarchy. I 
leave further development of these ideas for a later time. 

The other element in (40) is che'e, which occurs only in the absolute final position. 
che'e is a discourse particle which Laughlin (1975) translates as ‘then’ (roughly Spanish 


pues): 
(41) L-i-bat xali vo'onze che'e, 
CP-ABS.1-go CL DET PRO.1SG-DEF then 
‘Me, I went, then? (Laughlin 1977: 131) 
che'e can co-occur with =e and when it does, the high boundary tone appears to be 
realized on the last syllable of che'e, not on =e. It seems then to be a counterexample to 


the descriptive generalization that =e is always the prosodic peak of the iP in which it 
occurs. 
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A plausible scenario is that che’e is incorporated into the iP which ends in =e after the 
point at which the constraints discussed above have had an effect. Fleshing this out a 
little, che’e might be syntactically adjoined to the root and mapped into its own iP (like 
topics and extraposed clauses). This iP being however subminimal (two syllables, one 
word), che'e is incorporated into the preceding (D (on the tendency to avoid short tP’s 
or sequences of iP's of different length, see Nespor & Vogel 1986 and Dehé 2009). The 
result here is to push =e back from the edge, and for the boundary tone to fall on the final 
syllable of che’e. An account along these lines assumes that the constraints in Figure 3 
apply within the domain of tP’s that result from the initial prosodic parsing and do not 
reapply at a later stage when prosodic restructuring of multiple iP's occurs. If they did, 
=e would be reordered again, to the right of che'e. How such an account with its implied 
serial optimization fits into the larger theory of the syntax-phonology interface remains 
to be seen. 


6 An historical scenario 


Definite markers which close the phrase in which they are licensed are not uncommon 
(Dryer 2013). It is plausible then that the definite enclitic =e in Z Tsotsil might, at an 
earlier stage, have been the final element in the noun phrase, a position in which it would 
not necessarily have functioned as the prosodic peak of iP. Here I offer a suggestion for 
how =e might have come to be associated with that peak, an association which now 
sometimes forces it out of its licensing phrase. 

The basic idea is simple: the syntax usually determines an utterance-final position for 
the phrase which licenses =e. Hence even without intervention from the phonology, =e 
would have found itself in most cases at the right edge of the utterance. As such, it would 
become statistically associated with the prosodic peak of the iP and this could have been 
reanalyzed as a lexical property. 

There are several reasons why the syntax usually puts the phrase which licenses =e 
in utterance-final position. A number of them come down to the fact that certain gram- 
matical relations in Tsotsil are almost always instantiated by definite noun phrases and 
the syntax determines a position for these relations at the right edge of the utterance 
anyway. These include especially subjects, possessors, and topics. The usual ordering of 
these elements is shown in (42). Starting with topics, as we have already seen, the topic 
precedes its associated clause and always constitutes its own iP. As the final element in 
the topic then, =e automatically falls at the right edge of iP. 


(42) o Topic X 
o V-O-S 
o Possessum - Possessor 


Basic word order in Tsotsil is usually described as VOS, with the subject in clause-final 
position. Transitive subjects (as well as active intransitive ones) are almost always defi- 


3 Languages with such markers include Wolof (Niger-Congo, Torrence 2013), Basque (Laka 1996), Angami 
(Tibeto-Burman, Giridhar 1980, cited in Dryer 2013), and Gaahmg (Nilo-Saharan, Stirtz 2012). 
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nite, so generally license =e. Unless the subject is followed by some other element (e.g., 
an adverb, a PP, an element in a matrix clause), =e again finds itself at the right edge of 
ıP. Finally, Tsotsil being a head-initial language, the possessor follows its possessum, as 
in (43). 


(43) L-i-bat ta [s-na [li Xunze]]. 
CP-ABS.l-go P GEN.3-house DET Juan-DEF 


‘I went to Juan's house? 


Possessors too are almost always definite, and often end up as the final phrase in an 
utterance. Here too, =e’s position at the right edge of iP is determined by the syntax. In 
all these cases then, =e is the last syllable in iP, the position associated with the prosodic 
peak. 

Of course, the phrase which licenses =e does not always occur utterance-finally — if 
it did, there would be no motivation for this paper. But in a fragment of written text 
containing 156 instances of =e, there were only three in which that phrase did not occur 
utterance-finally. In these cases, =e was separated from its licensing phrase, as in (12)-(17) 
above. Thus, if it is true that the position of =e was originally determined syntactically, 
it would nonetheless have had a statistical association with the phonological properties 
that characterize the prosodic peak of iP and reanalysis of this association as a lexical 
property would have resulted in the situation we see today. 


7 Conclusion 


This paper has attempted to lay out the case for Z Tsotsil =e as a special clitic - one 
whose surface position is not always a position it could have reached syntactically. If 
this is correct, the phonology does something here other than select the prosodically 
optimal position for =e from among the syntactically possible ones. It must achieve 
the effect of moving =e within a prosodically-defined domain. In the analysis proposed 
here, =e is not subject to an alignment constraint; rather, it ends up at the right edge of 
(D because it must function as the prosodic peak of iP, and that peak is located at the 
right edge of iP. Complying with this requirement sometimes involves reordering the 
enclitic over a fairly large distance. Since the reordering occurs in the phonology, it is 
not subject to syntactic locality. It is, though, subject to prosodic locality, as =e always 
remains within the iP that contains its licensor (fn. 12). 

Tsotsil =e thus appears to be different from the the second position clitics discussed in 
Bošković (2000) and Bermüdez-Otero & Payne (2011), clitics which can reach their sur- 
face positions by syntactic means. The difference might be understood in terms of the 
property which determines their surface position. The position of the second-position 
clitics of Chamorro and Serbo-Croatian is determined by a prosodic alignment condition. 
But prosodic constituency is introduced in the interface between syntax and phonology 
and is therefore present before the phonology proper. The placement of second-position 
clitics can therefore be determined prior to the phonology and without any involvement 
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of the phonology. On the other hand, if the analysis of Z Tsotsil =e suggested here is on 
the right track, its position cannot be determined until the phonology proper, since it is 
only in the phonology that the location of prosodic prominence within the iP is fixed at 
the right edge. In this light, the special clitic status of =e arises because the condition 
which makes it “special” - which forces it out of its licensing phrase - references a purely 
phonological property and not a prosodic edge. 


Abbreviations 

ASP aspect ICP incompletive aspect 
CL clitic P preposition 

CP completive aspect PRO pronoun 

DEF definite terminal clitic PAR particle 

DIR directional RR reflexive/reciprocal 
J existential predicate SS status suffix 


EXCLAM exclamatory particle 
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Chapter 13 


Another way around causatives in 
Chamorro 
Sandra Chung 


University of California, Santa Cruz 


In Anderson’s (1992) theory of a-morphous morphology, the traditional observation that 
inflection occurs “outside of” derivation follows from the assumption that only lexically 
complete stems can instantiate morphosyntactic representations. Anderson discusses an 
apparent counterexample to the traditional observation that involves causative verbs and 
number agreement in the Austronesian language Chamorro. Anderson defuses the apparent 
counterexample by proposing, following Durie 1986, that Chamorro number agreement is a 
derivational, rather than inflectional, process. I show that there is a different way of finessing 
the issue that preserves the intuition that Chamorro number agreement is inflectional. This 
alternative takes the causative ‘prefix’ to be a prosodically deficient verb, in the overall spirit 
of Anderson 2005. 


1 Introduction 


In Anderson’s theory of a-morphous morphology, the traditional observation that inflec- 
tion occurs “outside of” derivation follows from the assumption that only lexically com- 
plete stems can instantiate morphosyntactic representations. Anderson (1992: 127-128) 
discusses an apparent counterexample to the traditional observation from Chamorro, an 
Austronesian language of the Mariana Islands. Chamorro has causative verbs which, 
according to previous accounts, are formed by attaching the prefix na’- to a verb or ad- 
jective (see e.g. Baker 1985; Gibson 1980, Safford 1904: 108, and Topping & Dungca 1973: 
247-249). The point of interest is that na’- can attach to a verb or adjective that already 
shows number agreement. Assuming that na’- is derivational but number agreement is 
inflectional, the fact that na'- can occur "outside of" number agreement is problematic. 
Anderson defuses the apparent counterexample by proposing, following Durie (1986: 
364-365), that Chamorro number agreement is a derivational, rather than inflectional, 
process. 

Here I explore a different way of finessing the issue, one that preserves the intuition 
that Chamorro number agreement is inflectional. The key to this alternative is to take 
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the causative “prefix” to be a prosodically deficient verb, in the spirit of Anderson’s 2005 
approach to clitics as phrasal affixes. Chamorro has a small class of prosodically defi- 
cient verbs that are instances of Zwicky’s 1977 bound words. These forms have the mor- 
phosyntax of verbs, but cannot serve as phonological words on their own. Instead, they 
must remedy their prosodic deficiency by undergoing stray adjunction to the phonolog- 
ical word to their immediate right, which is always the first phonological word of their 
complement. 

I show that much of the distinctive profile of Chamorro causatives is accounted for if 
the causative na’ is treated as a prosodically deficient verb that selects a vP complement. 
Moreover, once this route is taken, Chamorro causatives no longer pose a threat to the 
"outside-inside" order of inflection and derivation, even if number agreement is inflec- 
tional. This is because the causative na' that can appear "outside of" number agreement 
is not, in fact, derivational morphology, but rather the prosodically deficient content of 
a syntactic verb. 

82 of this paper gives a mini-introduction to the morphosyntax of Chamorro clauses. 
§3 presents the basics of causatives and their interaction with the language's two types 
of subject-verb agreement. $4 looks closely at Durie's 1986 evidence that Chamorro 
number agreement is derivational and concludes that it is not decisive. Then, $5 gives an 
overview of Chamorro's prosodically deficient verbs. $6 proposes that the causative na' 
is a prosodically deficient verb and explores some positive consequences of this proposal. 
87 handles some challenges, and §8 concludes. 


2 Chamorro Morphosyntax in Brief 


Chamorro is a head-initial language that allows predicates of all major category types 
and a range of null arguments. When the predicate is a verb or adjective, the default word 
order of the clause is Predicate Subject Object Other, but the order of arguments and 
adjuncts after the predicate is flexible (see Chung 1998 and the references cited there)! 


(1) a. Ha babasi  Antonioi petta. 
P.AGR open UNM Antonio the door. 


‘Antonio opened the door: 
b. Kumati i neni sa’ ma’a’fiao ni sanye’yi’. 
N.AGR.cry the baby because N.AGR.afraid OBL spider 


"Ihe child cried because she's afraid of the spider? (CD, entry for sanye’yi’) 


DPs are case-marked with a proclitic that occurs to their immediate left. There are 
three morphological cases: unmarked, local, and oblique. Subjects, direct objects, pos- 
sessors, predicate nominals, the objects of most overt prepositions, and DPs that occupy 


1 Most of the Chamorro examples cited here are from the CD database, which consists of some 30,000 sen- 
tences constructed by Chamorros in the CNMI as illustrative examples for the revised Chamorro-English 
dictionary. Other examples are from published sources listed in the references; unattributed examples are 
from my fieldwork. 
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topic or focus position occur in the unmarked case, which is overtly realized only when 
the DP is a proper name. Otherwise, DPs that denote locations or goals occur in the local 
case; most other types of DPs occur in the oblique case. 


(2 Ma rikuknisa si Estherani finatton-fia gi hunta. 
P.AGR recognize UNM Esther OBL arrival-poss Loc meeting 


‘They acknowledged Esther for her coming to the meeting: (CD, entry for 
rikuknisa) 


Predicates that are verbs or adjectives show subject-verb agreement via forms that 
also indicate mood (realis vs. irrealis) and are sensitive to transitivity. There are two 
types of subject-verb agreement. Person-and-number agreement (glossed P.AGR in the 
examples) is realized via forms that could be analyzed as proclitics or prefixes, but are 
written as separate words in the Chamorro orthography; see the paradigm in (3)? In 
the realis mood, this type of agreement is found only on transitive verbs; in the irrealis 
mood, it is found on all verbs and adjectives. 


(3) Person-and-Number Agreement 


Realis Irrealis 
1sc hu (bai) hu / bai 
2sG un un 
3sc ha u 
1INCL.DU/PL ta (u)ta 
1EXCL.DU/PL in (bai) in 
2DU/PL en en 
3DU/PL ma u (INTR) / uma (TR) 


Number agreement (glossed N.AGR) is realized via a prefix or infix; see the paradigm 
in (4). This type of agreement is found only on intransitive verbs and adjectives. 


(4) Number Agreement 


Realis Irrealis 


SG/DU -um-/— — 
PL man- fan- 


? Chamorro has various standard and nonstandard orthographies (see Chung 1998: Appendix A). The or- 
thography used here, which was officially adopted by the CNMI legislature in 2010, differs in small ways 
from the transcription used in Chung 1998, and more substantially from earlier spelling systems, including 
the official orthography on Guam. 

3 The choice between the two realizations of realis singular number agreement is determined lexically. Gen- 
erally, -um- is used for event predicates, as well as for state predicates in the inchoative aspect; otherwise, 
the agreement is generally unrealized for state predicates. But there are exceptions. The realizations of 
plural number agreement have a final /n/ that undergoes the alternation known as nasal substitution. 
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Notice that dual is aligned with plural for the purposes of person-and-number agree- 
ment, but with singular for the purposes of number agreement. This will become impor- 
tant later. 

Both types of agreement are the default realizations of subject-verb agreement for 
predicates of the relevant type, and fully productive; e.g. they can be added to recently 
borrowed words (as in 5a), even when the borrowings are creative or innovative (as in 


5b). 


(5 a. Man-meeting ham gi Lunis. 
N.AGR-have.meeting we Loc Monday 
"We had a meeting on Monday: (CNMI Senate session SJ 17-22: 20) 
b. Bai hu “love-mark” i  kurason-mu. 
P.AGR love-mark the heart-Poss 


‘I will “love-mark” your heart. (EM 60) 


Finally, the two types of agreement have overlapping distributions. Transitive verbs 
show only person-and-number agreement (see 1a, 2, 5b and 6a); intransitive predicates 
in the realis mood show only number agreement (see 1b, 5a, and 6b); but intransitive 
predicates in the irrealis mood show both. Note that when the two types of agreement 
co-occur, person-and-number agreement occurs “outside of” — i.e. to the left of - number 
agreement (see 6c). 


(6) a. Hu  afuetsas gui’ parau atan yu’. 
P.AGR compel her FUT r.AGR look.at me 


‘I compelled her to (lit. that she would) look at me’ (CD, entry for afuetsas) 
b. Duránti-ni  tinaitai, bula mang-áti. 

during-L the prayer many N.AGR-cry. 

‘During the prayer, many cried’ (CD, entry for duranti) 


c. Ti parau fang-ati i  famalao’an. 
not FUT P.AGR N.AGR-cry the women 


‘The women are not going to cry. 


With this much in place, let us now zero in on causatives. 


3 Causatives 


Previous accounts describe Chamorro causatives as formed by adding the prefix na’- to 
a verb or adjective (see e.g. Baker 1985; Gibson 1980; Safford 1904; Topping & Dungca 
1973). This process creates a derived transitive verb with an additional argument, which 
denotes the causer. The causer argument is realized as the subject of the causative; the 
subject of the original predicate (henceforth the inner predicate) is realized as the direct 
object of the causative; and the direct object of the inner predicate, if any, is realized as 
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an oblique (see Gibson 1980). To illustrate, the causatives na’baba ‘make open’, na’kati 
‘make cry’, and na’ma’a@’fiao ‘make afraid, frighten’ are derived, respectively, from the 
transitive verb baba ‘open’ (cf. 1a), the intransitive verb kati ‘cry’ (cf. the first clause of 
1b), and the adjective ma’a’fiao ‘afraid’ (cf. the second clause of 1b). 


(7) a. In  na-baba si Antonioni petta. 
P.AGR CAUS-open UNM Antonio OBL door 
"We made Antonio open the door: 
b. Ha na’-kati si  Genei lahi-fa anai ha lalátdi. 
P.AGR CAUS-cry UNM Gene the son-Poss when P.AGR scold 
“Gene made his son cry when he scolded him’ (CD, entry for kati) 
c. Un na’-ma’a’fao yu’ ni taklalo'-mu. 
P.AGR CAUS-afraid me OBL great.anger-Poss 


“You made me afraid with your great anger? 


Gibson’s 1980 investigation of the syntax of Chamorro causatives established three 
points that will be in the spotlight here. First, causatives have the morphosyntax of the 
language’s transitive verbs (Gibson 1980: 86-91). Like other transitive verbs, they can 
occur in the passive ` 


(8) a. Ma-na’-gimin i patgun amut ni ti dinanchi. 
N.AGR.PASS-CAUS-drink the child | medicine comp not N.AGR right 
"Ihe child was made to drink medicine that was not right’ (CD, entry for 
tumaiguihi) 
b. Kulan nina’-ma’a’fiao i biha nu esti na klási-n tinanum. 
sort.of N.AGR.PASS.CAUS-afraid the old adv opt, thisr type-r plant 
"Ihe old lady was kind of made afraid by this type of plant? (MAK 2) 


They can also occur in the antipassive.? 


(9) Mu-nana’-gupu papalotisi Juanito gi kantu-n tasi. 
N.AGR-AP.CAUS-fly.PROG kite UNM Juanito Loc edge-L ocean 
"Juanito is flying a kite (lit. making a kite fly) by the seashore’ (CD, entry for 
na'gupu) 
And they can be used to create reciprocals - derived intransitive verbs, formed with 


the stressed prefix á-, which are Chamorro's primary means of expressing reciprocal 
meaning. 


^ Passive verbs are formed with the infix -in- or the prefix ma-. The choice between -in- and ma- is determined 
primarily by the number of the passive agent: -in- appears when the agent is singular, ma- when the agent 
is dual/plural or implicit (see Chung 1998: 38, note 8). 

5 Antipassive verbs are usually formed with the prefix man-/fan-. However, some transitive verbs have 
suppletive antipassive forms (e.g. the antipassive of kannu’ ‘eat’ is chotchu); others have antipassive forms 
identical to their transitive forms (e.g. gimin ‘drink’). The antipassive of a causative is formed by shifting 
primary stress to the causative prefix. 
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(10) Kao um-a-na’-patcha hamyu ni feggun? 
Q  N.AGR-RECIP-CAUS-touch you.PL OBL stove 


‘Did you two make each other touch the stove?’ 


Second, causatives can be derived from verbs that are morphologically complex (see 
Gibson 1980: 114-121). The causatives in the examples in (11) are derived from verbs - 
surrounded by brackets - that are passive (11a- 11b), antipassive (11c), and reciprocal (11d). 


(11) a. In  na'-[ma-baba] as Antonio. 
P.AGR CAUS-PASS-open OBL Antonio 


"We made it be opened by Antonio’ 
b. Bai na’-[sinaolak] hao nu i  ma'estra. 

P.AGR CAUS-PASS.spank you OBL the teacher 

‘T will let you be spanked by the teacher’ (CD, entry for sinaolak) 
c. I bakulu-hu ha-na’-[fang-ganna] yu’. 

the shooter.marble-Poss P.AGR-CAUS-AP-win me 

‘My shooter marble made me win’ (CD, entry for bákulu) 
d. Ma  na'-[á-dispatta] i dos táotao ni mumu. 

P.AGR CAUS-RECIP-separate the two person COMP NACH. Debt 


"Ihey separated (lit. caused to separate from each other) the two people who 
were fighting’ (CD, entry for na'ádispatta) 


As these observations might lead one to expect, causatives derived from morphologi- 
cally complex verbs can themselves occur in the passive, antipassive, or reciprocal. The 
verbs in boldface in (12) are the passive of a causative derived from a passive verb (in 
12a) and the passive of a causative derived from an antipassive verb (in 12b). 


(12 a. ..yanmasehaháyi malago’-fa i Lahi-fa parau 
andever who wH.want-poss the son-POSS FUT P.AGR 
nina’-[ma-tungu’] Gui. 
PASS.CAUS-PASS-know he 


`. and whoever his Son wants to cause Him (lit. that He be caused) to be 
known by: (NT 124) 


b. Nina -[faü-otsut] anai ma-nái máolik na kunseha. 
N.AGR.PASS.CAUS-AP-repent when N.AGR.PASS-give good £L advice 


‘She repented (lit. was caused to repent) when she was given good advice? 
(CD, entry for na’fanotsut) 


$ Although it is possible in principle for causatives formed from a verb in any voice to occur in any voice, 
the naturally occurring data suggest that some combinations are more frequent than others. When the 
causative is active transitive or passive, the inner predicate can be active (transitive or intransitive), passive, 
antipassive, or reciprocal. When the causative is antipassive or reciprocal, the inner predicate is most often 
active (transitive or intransitive). 
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A causative can even be derived from the passive of a causative, as (13) shows. 


(13) Si Josephine ha na’-[ma-na’-[suha]] i atgoya gi gui’eng-fia. 
UNM Josephine P.AGR CAUS-PASS-CAUS-go.away the nose.ring LOC nose-POss 


‘Josephine had her nose ring removed (lit. caused the nose ring to be caused to go 
away). (CD, entry for atgoya) 


Third, the inner predicate - the verb or adjective from which a causative is derived 
— does not show person-number agreement. But, surprisingly, the inner predicate does 
show number agreement (see Gibson 1980: 112-114). Inner predicates that are intransitive 
agree with the DP that would have been their subject via irrealis number agreement, 
which is unrealized in the singular/dual, but spelled out as the prefix fan- in the plural. 
This number agreement is not realized on the inner predicates in (11-13), because the 
DPs that would have been their subjects are singular/dual (e.g. the null pronoun ‘it’ in 
(11a), hao ‘you (sg.)’ in (11b), yu’ ‘me’ in (11c)), but it is overt on the inner predicates 
in (14), because the DPs that would have been their subjects are plural. (Note that the 
inner predicates in (14) are clearly not agreeing with the subject of the causative, which 
is singular.) 


(14 a Hu na’-[fang-gupu]i  petbus. 

P.AGR CAUS-N.AGR-fly the dust 
‘I made the (particles of) dust fly around. (CD, entry for na’gupu) 

b. Ha na’-[fan-luhan] ham. 
P.AGR CAUS-N.AGR-afraid us 
‘[The wind] scared us (lit. made us afraid)? (CD, entry for diripenti) 

c. Ha na’-[fan-ma-kotti] i  guátdia,ya ha  na’-[fan-ma-punu’]. 
P.AGR CAUS-N.AGR-PASS-try the guard and P.AGR CAUS-N.AGR-PASS-kill 


‘He had the guards brought to trial, and had them killed’ (NT 235) 


d. I abisu ha  na'-[fan-man-unungu'] i taotao na 
the alarm P.AGR CAUS-N.AGR-AP-know.PROG the person that 


"The alarm is letting the people know that...[the typhoon is close]. (CD, entry 
for abisu) 


Baker (1985) used the relative order of the plural fan- with respect to the causative 
and passive affixes to argue for the Mirror Principle. As he observed, "clear examples of 
agreement morphemes that can appear intermixed with GF-rule morphemes seem quite 
unusual" (Baker 1985: 386). What matters here is that the plural fan- in the examples 
in (14) occurs “inside of" — i.e. to the right of - the causative na’-. Assuming that fan- 
is inflectional but na’- is derivational, this ordering appears to counterexemplify the 
traditional claim that inflection always occurs "outside of" derivation. 
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4 Number agreement revisited 


A natural question to raise at this point is whether Chamorro number agreement might 
be derivational as well. 


4.1 Is it derivational? 


As Anderson (1992: 127-128) observes, this question is answered in the affirmative by 
Durie (1986), who contends that across languages, verbal number - whether realized by 
stem suppletion or productive affixation - is “selectional concord” (i.e. derivational) as 
opposed to "agreement". Durie's evidence for this claim comes from various languages, 
including Chamorro. In the suppletion cases he examines (in e.g. Huichol), verbal num- 
ber is sensitive to semantic roles like patient or affected participant, not to syntactic 
relations like subject. Chamorro number agreement does not conform to this pattern, 
but instead cross-references the (surface) subject regardless of semantic role; this is one 
way that it behaves like a paradigmatic case of agreement. Still, Durie argues that num- 
ber agreement in Chamorro is "inherent verbal Number morphology" (Durie 1986: 364) 
whereas person-and-number agreement is inflectional, on the basis of the following: 


e Number agreement distinguishes plural from nonplural (i.e. plural from singu- 
lar/dual), but the number feature on nouns and pronouns distinguishes singular 
from nonsingular (i.e. singular from dual/plural), so “[t]here is no [+plural] feature 
for the verb to agree with" (Durie 1986: 364). 


e Number agreement can have an overt pronoun as antecedent, whereas person- 
and-number agreement cannot. 


e Number agreement appears in infinitives, imperatives, and attributive modifiers, 
whereas person-and-number agreement does not. 


e Number agreement is preserved in lexical derivations, such as causatives (see 
above), whereas person-and-number agreement is not. 


These may look like good reasons for classifying number agreement as derivational 
- a move that would make it unsurprising in the extreme that the plural fan- can occur 
“inside of" the causative na’-. But further examination suggests a more equivocal picture. 


4.2 A second look 


Consider, to begin with, the claim that Chamorro nouns and pronouns have a differ- 
ent number feature than what is registered by number agreement. The specific claim 
is that nouns and pronouns employ the feature [+singular] - they distinguish singular 
from dual/plural - whereas number agreement employs the feature [+plural] - it distin- 
guishes singular/dual from plural (see the paradigm in (4)). Assuming that inflectional 
morphology is the spell-out of syntactic features, the disconnect between these features 
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might seem to pose an insuperable problem for the view that number agreement is in- 
flectional (but see below). 

Overt pronouns in Chamorro do indeed employ the feature [+singular] - they distin- 
guish singular from dual/plural, as observed explicitly by e.g. Safford (1903: 308). The 
second person independent pronouns hágu and hamyu, for instance, differ in that hagu 
refers to just one addressee, while hamyu refers to two or more addressees. The other 
overt pronouns are similar. It is less obvious how number is handled in nouns, because 
most Chamorro nouns do not show obligatory number inflection. Just a handful of nouns, 
listed in (15), are inflected obligatorily, and somewhat irregularly, for number. 


(15) a. Singular Dual Plural 
che’lu chumelu mañe’lu ‘sibling’ 
b. lahi lalahi ‘man, son’ 
palao’an famaláo'an ‘woman’ 
pali’ mamiali’ ‘priest’ 
patgun famagu’un ‘child’ 
saina mafiaina ‘parent’ 


The noun che'lu has separate forms for singular, dual, and plural. The other nouns 
have forms which are usually termed “singular” and “plural” (e.g. Safford 1903: 302-304, 
Topping & Dungca 1973: 325), but actually distinguish singular/dual from plural. That 
is, they employ the feature [+plural]. The examples in (16) reveal that when these nouns 
refer to just two individuals, they are realized in the singular/dual form, not the plural 
form. 


(169) a. Um-iskuekuela i dos patgun sanlagu. 
N.AGR-attend.school.proc the two child continental.US 


"Ihe two children are attending school in the continental U.S’ (CD, entry for 
sanlagu) 


b. Dos na palao’an u fang-gugulik trigu. 
twoL woman P.AGR N.AGR.AP-grind.PROG grain 


"Two women will be grinding grain’ (NT 48) 


The claim that the nouns in (15b) align dual with singular is supported by naturally 
occurring data.’ There are 30 instances in the CD database, and 23 instances in the first 
150 pages of the Chamorro New Testament (NT), of these nouns occurring in explicitly 
dual DPs - DPs whose noun is preceded by the numeral dos ‘two’. In 51 out of the 
combined 53 instances, the noun occurs in the singular/dual form. 

It is now clear that Chamorro pronouns employ the feature [+singular], but obligato- 
rily inflected nouns employ the feature [+plural] or - in the case of che'lu — both features. 


7 Native speakers' judgements trend in the same direction, but are more forgiving. For instance, when asked 
which of the following two forms she would use to refer to two children, one speaker commented that i 
dos patgun ‘the two children’ (with the singular/dual form of the noun) was better for her, but that i dos 
famagu'un (with the plural form of the noun) “will be understood in most circumstances". 


271 


Sandra Chung 


This makes it reasonable to suppose that Chamorro DPs are specified for [+singular] and 
[+plural], even though in the vast majority of cases, these features have no DP-internal 
realization. But then the way that number is handled by the agreement system is com- 
patible with the idea that both types of agreement are inflectional. Person-and-number 
agreement simply registers one of the number features (namely, [+singular]), while num- 
ber agreement registers the other ([+plural]) 

I now turn to Durie's other evidence that number agreement is derivational. It consists 
of the following: 

- Number agreement can have an overt pronoun as antecedent, but person-and-num- 
ber agreement cannot. (The only pronouns that can antecede person-and-number agree- 
ment are null pronouns; see also Chung 1998: 30-31.) Durie takes these facts, which 
are illustrated in (17), to show that person-and-number agreement is “anaphoric”, but 
number agreement is not. 


(17 a. Yayas (gui’). 
N.AGR.tired s/he 


‘S/he is tired: 


b. Ha fahan (*“gui’)i  lepblu. 
P.AGR buy s/he the book 


‘S/he bought the book’ 


Now, the contrast in (17) could ultimately reflect a difference between derivation and 
inflection. But it is equally likely that it flows from some linguistic notion of "efficiency" 
or “brevity” (cf. Grice) plus the featural content of the two types of agreement. Person- 
and-number agreement encodes exactly the same features as Chamorro pronouns - 
namely, person features and [+singular] - so a ban that prevents this type of agree- 
ment from being anteceded by an overt pronoun contributes to the goal of minimizing 
redundancy. A comparable ban on number agreement would have no rationale, because 
number agreement encodes a different feature — [+plural]. 

- Number agreement appears in infinitives, imperatives, and attributive modifiers, but 
person-and-number agreement does not. Consider the imperative in (18). 


(18) (En) Fan-man-hokka’ sa’ bula pineddung mángga gi egga'an. 
P.AGR N.AGR-AP-pick because N.AGR.many fallen.. ` mango roc morning 


‘Go and do some picking, because there were many fallen mangos in the 
morning. (CD, entry for poddung) 


To the extent that this observation is valid, it could bear on the contrast between 
derivation and inflection, but other explanations are possible. Suppose, for instance, that 
number agreement realizes a feature of small v, whereas person-and-number agreement 


8 In conjoined imperatives, the leftmost imperative verb does not show person-and-number agreement, but 
verbs in subsequent conjuncts generally show irrealis person-and-number agreement as well as number 
agreement (if applicable). The embedded “clause” in restructuring constructions can either be inflected like 
an infinitive or show realis person-and-number agreement; see 6.2. 
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realizes features of T. Then number agreement would be expected to appear in infinitives 
and imperatives, because these constructions are at least vPs; there might be no similar 
expectations for person-and-number agreement. I will adopt a version of this approach 
below. As for attributive modifiers, it should be noted that Chamorro allows relative 
clauses to precede or follow the head NP; it also allows relative clauses whose head NP 
is null (see Borja, Chung & Wagers 2015). The attributive modifiers that show number 
agreement can straightforwardly be analyzed as predicates of one or another of these 
relative clause types. 

- Finally, number agreement is claimed to be preserved in lexical derivations, such 
as causatives and what Durie calls “nominal derivatives”. Causatives are, of course, the 
focus of investigation here. The “nominal derivatives” are not, in fact, derived nouns 
but rather relative clauses whose head NP is null. Two of Durie’s examples are given 
below, with the spelling normalized. In these constructions, the word that shows number 
agreement is the verb of the relative clause, which happens to be intransitive. 


(1) a. i  humanao 
the N.AGR.go 
‘the (one) who went’ (translated by Durie as ‘the goer’) 
b. iman-hanao 
i N.AGR-gO 
‘the (ones) who went’ (translated by Durie as ‘the goers (> 2)’) 

Notice that when the verb of the relative clause is transitive, it can show person-and- 
number agreement; see the relative clauses in brackets below.’ This too is expected if 
these constructions are relative clauses. 

(20) a. Abansa [i un chochogui]. 
advance the p.AGR do.PROG 
‘Go forward with the (thing) which you are doing (CD, entry for abánsa) 


b. Hu angokkuna  paraun cho’gui[i hu faisin hao]. 
P.AGR trust COMP FUT P.AGR do the P.AGR ask you 


‘T trust that you will do the (thing) which I ask you’ (CD, entry for angokku) 


In the end, the evidence cited by Durie provides no firm basis for classifying number 
agreement as derivational or inflectional. But then we are back to the original conun- 
drum: why can the plural fan- occur “inside of” the causative na’-? I propose to answer 
this question by analyzing the causative na’ not as a derivational prefix, but as a prosod- 
ically deficient verb. 


5 Prosodically deficient verbs 


The proposal to analyze the causative na’ as a prosodically deficient verb assimilates it to 
avery small class of frequently used Chamorro verbs. This class contains the intransitive 


? The verb of the relative clause can also show wh-agreement, but that is irrelevant here. 
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verb malak/falak ‘go to, head to, depart for’ and the transitive verb fa’ ‘pretend’.’” Both 
verbs are clearly the content of lexical categories; they are not derivational prefixes. Like 
other verbs, they serve as the predicates of clauses, show subject-verb agreement, are 
inflected for mood and aspect, and so on. More significantly for our purposes, they select 
a functional projection as their complement. 

Malak/falak ‘go to’ selects a DP that is linked to its goal argument. This DP, which is 
bracketed in (21), can include determiners (see 21a) and modifiers (21a-21b); it can also 
consist of a place name (21c) or an interrogative pronoun (21d). This range of expansions 
reveals that syntactic incorporation, however analyzed, is not involved. 


(21) a. Man-malak [i Pala na kasinu] ham. 
N.AGR-go.to the Palat casino we 
"We went to the Pala casino’ (CD, entry for kasinu) 

b. Ti ya-hu malak [ottru tanu’]. 
not like-poss N.AGR.INFIN.go.to other land 
‘I don't like to go to other places. (CD, entry for gástu) 

c. Yanggin gaigi haoSaipanya paraun  falak [Tinian], siempri 
if N.AGR.be.at you Saipan and FUT P.AGR N.AGR.go.to Tinian indeed 
humanao hao luchan. 

N.AGR.go you south 
‘If you are on Saipan and traveling to Tinian, you will have to go south’ (CD, 
entry for luchan) 

d. Malak [manu] hao nigap? 

N.AGR.go.to where? you yesterday 


"Where did you go yesterday?' (CD, entry for malak) 


Fa’ ‘pretend’ selects a finite realis TP complement. This embedded TP can have a 
predicate of any major category type, and when the predicate is a verb or adjective, it 
shows subject-verb agreement, as expected. 


(22 a In fa [in tingui ti un tungu]. 
P.AGR pretend P.AGR know the not ».AGR know 


"We (excl.) pretend we know what you don't know: (from a conference 
speech) 


10 I represent these verbs without dashes in order to highlight the fact that they are not prefixes. Note that 
malak/falak is an m/f verb; its initial consonant is realized as /m/ in the realis mood or when preceded by 
plural number agreement, but as /f/ otherwise. Fa’ is, confusingly, homophonous with a prefix fa’- that 
creates derived verbs meaning ‘make (into, with)’. This prefix attaches productively to nouns (e.g. fa'hànum 
‘liquefy’, from hánum ‘water, liquid’; fo denn?" ‘prepare with hot sauce’, from donni’ ‘hot pepper’), and less 
productively to adjectives (e.g. fa’baba ‘deceive’, from baba ‘bad’; fa’tinas ‘make’, from tunas ‘straight’). 
The verb fa’ ‘pretend’ and the derivational prefix fa’- are treated as the same affix by Topping & Dungca 
(1973: 176-77). 
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b. Ma tutuhun ma  fa' [man-kubátdi siha]. 
P.AGR begin ` P.AGR pretend N.AGR-cowardly they 


‘They began to pretend that they were afraid’ (NT 343) 


c. Ha fa [sen-metgut gui]. 
P.AGR pretend N.AGR.extremely-strong he 
‘He pretended to be extremely strong: 

d. Ha fg [i anghit gui] si Juan sa gaigi i 
P.AGR pretend the angel he unm Juan because w.AGR.be.at the 
nobia-fia. 
girlfriend-Poss 


John is acting like an angel (lit. pretending he is the angel) because his 
girlfriend is here’ (CD, entry for ánghit) 


The distinctive property of these verbs is that they are prosodically deficient; they are 
not phonological words and cannot bear primary stress. They remedy this deficiency by 
undergoing stray adjunction to the phonological word to their immediate right, which 
(in Chamorro) is always the first phonological word of their complement.!! In (21c), for 
instance, stray adjunction attaches falak to the phonological word Tinian (as shown in 
(23a)); in (22a), it attaches fa’ to the phonological word in tingu’, which itself consists 
of an agreement proclitic adjoined to the phonological word tingu’ ‘know’ (as shown in 


(23b)). 


(23) a. Q b. Q 
a SS 
falak œ fa ow 
| px 
Tinian in o 


tingu' 


Morphophonological processes which affect verbs, but whose domain is the phonolog- 
ical word, cannot affect a prosodically deficient verb directly. Instead, they must target 
the phonological word that immediately dominates it. In Chamorro, for instance, the 
progressive aspect is realized via reduplication of the primarily stressed CV of the pred- 
icate. When malak/falak or fa' occurs in the progressive, the CV that is reduplicated is 
the primarily stressed CV of the phonological word that immediately dominates them 


H In prosodic theory, stray adjunction is the operation that incorporates elements that are not parsed as 
prosodic units at a given level of prosodic structure into an adjacent prosodic unit at that level; see e.g. 
Anderson 2005: 13. The text assumes that in the cases under discussion, stray adjunction literally produces 
an adjunction structure. As Nick Kalivoda observes, another possibility is that a prosodically deficient verb 
simply becomes a daughter of the phonological word to its immediate right. 
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(which is also the primarily stressed CV of the phonological word to which they are 
adjoined). See (24). 


(24) a. Siempri [malak i tetenda] yu’. 
indeed N.AGR.go.to the store.PROG I 


‘I will definitely be going to the store? 


b. Ha [fa mudodoru] ha’ gui’. 
P.AGR pretend N.AGR.stupid.PROG EMP he 


‘He is just pretending that he is stupid’ (CD, entry for mudoru) 


The same holds true for other processes that are sensitive to prosodic structure. Among 
the overt pronouns of Chamorro are a set of weak pronouns which are second position 
clitics (e.g. yw’ T, hao ‘you (sg.)’). These weak pronouns occur right after the first phono- 
logical phrase of the intonational phrase corresponding to their clause (see Chung 2003). 
Because most Chamorro clauses have predicates that are verbs or adjectives, and most 
verbs or adjectives are phonological words that project a phonological phrase of their 
own, a weak pronoun is usually positioned right after them (see e.g. 21c). But when the 
verb is prosodically deficient, a weak pronoun is - as expected - positioned right after 
the phonological word (and phonological phrase) dominating it. The relevant phonolog- 
ical word is enclosed in brackets below. 


(25) a. Tátnai [malak Luta] yu’. 
never N.AGR.go.to Luta I 
Tve never been to Rota’ (CD, entry for tátnai) 
b. [Ha fa gof-maolik] gui’ na taotao. 
P.AGR pretend N.AGR.very-good he L person 


‘He pretended to be a very good person’ (CD, entry for fa’) 


6 Causative na’ as a prosodically deficient verb 


The preceding should be enough to suggest why it would be helpful to reanalyze the 
causative na’- as a prosodically deficient verb. Then the exuberance of its interplay with 
voice, agreement, and the like can be attributed to the fact that it combines morphosyn- 
tactically with the material on its left, but merely prosodically with the material on its 


right. 


6.1 Proposal 


I propose to flesh out the details of this reanalysis as follows. Suppose that instead of a 
causative prefix na’-, Chamorro has a prosodically deficient verb na’ ‘make, let, cause’, 
which selects a small clause complement - specifically, a vP complement. In Chamorro, 
small v selects a complement that is VP or AP, so the verb na’ will occur in syntactic 
structures of the type shown in (26) (with specifiers omitted for convenience). 
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(26) a. VP b. VP 
P VER 
V vP V vP 
NEZ | 
na v VP na v AP 


The V or A that heads the embedded VP or AP in (26) corresponds to what was re- 
ferred to earlier as the inner predicate. Because the inner predicate has small v in its 
functional layer, and small v is responsible for voice, the inner predicate can be a verb 
that is passive, antipassive, or reciprocal. At the same time, when the verb na' occurs as 
the predicate of a finite clause, it will project its own small v (not represented in (26)), so 
it can independently be passive, antipassive, or reciprocal. This will account for much of 
the exuberant interplay that causatives exhibit. 

What about subject-verb agreement? In Chamorro, person-and-number agreement is 
always realized to the left of number agreement. This makes it reasonable to suppose that 
the two types of agreement spell out features of different functional heads, where the 
head whose features are spelled out by person-and-number agreement is the higher of 
the two. Now, word order aside, finite clauses in Chamorro have a familiar architecture 
in which the functional layer of the clause contains (at least) T and small v (see Chung 
1998; 2004). Let us assume, then, that T is specified for finiteness, mood, aspect, and the 
person and number of the DP in its specifier (= the subject). The relevant number feature 
here is, of course, [singular]. These features of T are spelled out by person-and-number 
agreement when the predicate is transitive or the mood is irrealis; see (3). Let us make 
the further, more interesting assumption that small v is specified for the number of the 
DP in its specifier via the feature [plural]. This feature of v is spelled out by number 
agreement when the predicate is intransitive; see (4). In the finite clauses of interest 
here, T has a vP complement, the DP in vP's specifier raises to the specifier of T, and 
number agreement spells out some features of T (finiteness and mood) as well as the 
number feature of small v. The mechanisms responsible for the multiple exponence of 
finiteness and mood are irrelevant here. What matters is that in structures in which vP 
is the complement of the verb na’, number agreement is spelled out with “irrealis” forms: 
as the prefix fan- when the DP in small v’s specifier is [+plural], and with no realization 
otherwise. 

Let us now turn to the prosody. The verb na' is prosodically deficient, so in the 
prosodic structure corresponding to (26) it will undergo stray adjunction to the phono- 
logical word to its immediate right, which is always the first phonological word of its 
complement. Assuming - crucially - that the word order ofthe small clause complement 
has already been determined, this phonological word will be the content of a verb or ad- 
jective. The verb or adjective may be morphologically complex and may begin with the 


12 A reviewer asks how transitivity is folded into the picture. I assume that T's features are spelled out as 
person-and-number agreement when T shares features with transitive small v - a small v that assigns 
abstract Case. Small v's number feature is spelled out as number agreement when small v does not assign 
abstract Case. See 7.2. 
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plural fan-. In other words, stray adjunction will lead to one of the outcomes schematized 
in (27). 


(27) a. Q b. Q 
/N AN 
na o na o 
| 
fan-...... 


Overall, this proposal gives a remarkably successful account of the morphosyntac- 
tic profile of Chamorro causatives presented in 83. Causatives have the morphosyntax 
of transitive verbs (see (8-10)) because na’ is, in fact, a transitive verb. The prosodic 
deficiency of this verb makes it appear to be a prefix - and therefore derivational morph- 
ology - but the appearance is illusory. Like other prosodically deficient verbs, the verb 
na’ selects a complement that is a functional projection - namely, vP - and undergoes 
stray adjunction to the phonological word to its immediate right, which is always the 
first phonological word of its complement. For independent reasons, this word is always 
the content of a verb or adjective (see (7)). The verb or adjective projects a vP in the 
syntax, so it is inflected for number agreement (see (14)) and can be morphologically 
complex (see (11-12)). Moreover, the claim that na' is a verb, as opposed to derivational 
morphology, makes it unsurprising that it can be the content of both the main verb and 
an embedded verb in recursive structures like (13). 

Note, finally, that the proposal is consistent with the way that na' interacts with 
morphophonological processes whose domain is the phonological word or phonologi- 
cal phrase. When na' occurs in the progressive aspect, the CV that is reduplicated is the 
primarily stressed CV of the phonological word that immediately dominates it (see Gib- 
son 1980: 79-81).? (For consistency, I continue to use the parsing and glossing conven- 
tions adopted earlier for causatives, even though na' is now analyzed as a prosodically 
deficient verb.) 


(28) a. Esta [nina’-chachatkuentus] ni malangu-fa. 
already N.AGR.PASS.CAUS-speak.incoherently.PRoG OBL sickness-Poss 


‘Her sickness is making her speak incoherently’ (CD, entry for chátkuentus) 
b. Hu ripárana un  [na'-malilisia] mampus i  palabrák-ku. 
P.AGR notice COMP P.AGR CAUS-malicious.PROG too.much the word-Poss 


'I noticed that you really are making my words malicious. (CD, entry for 
malisia) 


Further, weak pronouns are positioned not immediately after na’, but right after the 
phonological word (and phonological phrase) that dominates it. 


3 The progressive aspect in these examples must be interpreted as affecting the causative na’; it cannot be 
interpreted as affecting the inner predicate of the causative. See especially (28b). 


278 


13 Another way around causatives in Chamorro 


(29) [Man-na’-hanao] hamábiu parai  man-disgrasiào. 
N.AGR-AP.CAUS-go we support for the N.aGR-in.accident 


"We sent help for those involved in that accident’ (CD, entry for abiu) 


This is what we expect from a prosodically deficient verb; see §4. 


6.2 Consequences 


If this new approach turns out to be correct, Chamorro causatives no longer provide a 
counterexample to the traditional observation that inflection is *outside of" derivation. 
Instead, the causative na' is a prosodically deficient verb, and its relative order with 
respect to morphology which it happens to be prosodically attached to, but which be- 
longs morphosyntactically with a different predicate, is immaterial. The result stands 
even if Chamorro number agreement is taken to be inflectional, as in 6.1. This is why we 
embarked on the investigation in the first place. 

Further, and interestingly, the small clause complement of the verb na' turns out to 
fill a gap in the paradigm of Chamorro complementation. As might be expected, the 
language has various types of clausal complements, including finite clauses, infinitive 
clauses, and the embedded "clause" of restructuring constructions. Finite clauses and 
infinitive clauses are clearly TPs. They differ in that finite clauses are specified for mood 
and can have an overt subject, whereas infinitive clauses are mood-invariant and cannot 
have an overt subject (see Chung 1998: 64-68). Embedded “clauses” of restructuring 
constructions are similar to infinitive clauses in these respects, but smaller (see Chung 
2004). Given the claim in 6.1 that person-and-number agreement realizes features of T, 
these embedded "clauses" are best analyzed as defective TPs, as proposed by Bhatt 2005 
for Hindi-Urdu - TPs whose head is parasitic on the T of the clause under which they 
are embedded. 

The three types of clausal complements just described show number agreement and 
some person-and-number agreement. Finite clauses make full use of the agreement para- 
digms in (3-4). Infinitive clauses show realis number agreement when their predicate 
is intransitive and the invariant infix -um- when it is transitive. Embedded "clauses" of 
restructuring constructions show realis number agreement when their predicate is in- 
transitive and either realis person-and-number agreement, or the infix -um-, when it is 
transitive. 

If na' truly is a verb, then its small clause complement differs from the other types 
of clausal complements just mentioned along all of these dimensions. The small clause 
complement of the verb na’ is merely a vP - even smaller than the embedded “clause” of 
restructuring constructions - but it can have an overt subject. And, because it is merely 
a vP, it shows (irrealis) number agreement but no person-and-number agreement at all" 


Interestingly, Chamorro has at least one other verb that can select a vP complement: the imperative verb 
cha’- ‘don’t, shouldn't, better not’. As expected, the vP complement of cha’- (a) does not show person-and- 
number agreement, but (b) when intransitive, does show irrealis number agreement. Less expectedly, the 
specifier of this vP is always controlled PRO, and the verb or adjective from which vP is projected must be 
inflected for progressive aspect. Thanks to Pranav Anand for questions that uncovered this. 
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None of this language-particular fine detail is theoretically necessary or even expected, 
of course. But it is reassuring that the vP complement posited by our alternative ap- 
proach to causatives can be integrated smoothly into the overall picture of complemen- 
tation in Chamorro. 


7 Other aspects of the morphosyntax of causatives 


Other aspects of the profile of causatives present more of a challenge to the proposal 
just outlined. I discuss two such aspects below, with the aim of showing that they can be 
handled relatively straightforwardly once the right infrastructure is installed. One set of 
facts involves wh-agreement; the other involves morphological case. 


7.1 Wh-agreement 


When a constituent undergoes wh-movement in Chamorro, the verb or adjective on 
which it depends shows a special morphological agreement called wh-agreement (see 
Chung 1998 and the references cited there). This special agreement, which supersedes 
the normal forms of normal subject-verb agreement, signals the grammatical relation of 
the wh-trace - whether it is a subject, direct object, or oblique. For instance, when the 
wh-trace is a direct object, wh-agreement is realized by the infix -in- and nominalization 
ofthe verb. Nominalization is indicated, among other things, by the fact that the subject 
is cross-referenced by (suffixal) possessor agreement (glossed poss) rather than subject- 
verb agreement. Compare the sentence in (30a), in which the verb shows person-and- 
number agreement, with the constituent question in (30b), in which the verb shows 
wh-agreement. 


(30) a. Hu kakannu’i  gollai. 
P.AGR eat.PROG the vegetable 


'T m eating vegetables. (CD, entry for nos) 


b. Háfa kinannono mu? 
what? WH.eat-POSS.PROG 


"What have you been eating?' (from a tape-recorded narrative) 


In earlier work I analyzed wh-agreement as the result of feature sharing in abstract 
Case between a wh-trace and the T that most immediately commands it. The shared 
Case feature is then spelled out on the verb or adjective that projects T's complement. 
I will adopt this analysis here, noting that in minimalist syntax, abstract Case is often 
reconfigured in terms of the syntactic head that licenses the relevant DP via Agree. 

Let us now turn to causative sentences and consider the DP described at the beginning 
of 83 as the subject of the inner predicate. The proposal we are exploring treats this DP 
as the subject of the small clause complement of na’ — in other words, as the specifier of 
the embedded vP in the schematic diagram below. (This specifier is represented to the 
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right for convenience; see Chung 1998 on the derivation of predicate-first word order in 
Chamorro.) 


(31) v 


na wi DP 


zx 


V2 VP 


The small clause subject is in an ECM configuration, so it is licensed by Agree with 
the small v that immediately commands na’ (= vi in the diagram in (31)) in essentially 
the same way as if it were the direct object of na’. This licensing is confirmed by wh- 
agreement: when the small clause subject undergoes wh-movement, wh-agreement sig- 
nals that the wh-trace is a direct object (see Gibson 1980: 82, 164).^ 


(32) a Hayi na pilotu nina’-bastam-mu? 
who?r pilot wH.caus-quit-Poss 
"Which pilot did you fire (lit. make quit)?’ 
b. Ha  na'-moderátu si Liliani  [nina'-maipen-fia] hanum. 
P.AGR CAUS-moderate UNM Lillian the wH.cAus-hot-poss water 


‘Lillian moderated (the temperature of) the water that she was making hot’ 
(CD, entry for moderátu) 


Next, consider structures in which the inner predicate is transitive and so the small 
clause complement of na’ contains a direct object. This embedded direct object is licensed 
by Agree with the small v that immediately commands the inner predicate (= v; in (31)). 
Therefore, when it undergoes wh-movement, wh-agreement signals that the wh-trace is 
a direct object (see Gibson 1980: 197). 


(33) Háfa ninali'e-fia si Marianu hagu? 
what? WH.CAUS-see-POSS UNM Maria OBL you 


"What did Maria show you (lit. cause you to see)?’ 


Not only does wh-agreement register the same Case feature for both types of wh- 
traces, but in both constructions the verb on which the agreement is realized is the 


15 In (32b), the construction of interest is a prenominal relative clause (in brackets), and what has undergone 


wh-movement is — depending on one’s assumptions - either a null relative operator or else the head NP 
hánum ‘water’. 
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higher verb, namely, the causative na’. It is this verb that is infixed with -in- and un- 
dergoes nominalization, as can be seen from the fact that its subject (the causer) is the 
DP cross-referenced by possessor agreement. It may seem surprising that wh-agreement 
is realized on the higher verb, given that the wh-traces in (32) and (33) are arguments 
of the inner predicate. But the pattern follows from the syntactic structure proposed for 
causatives in §6.1, plus the independently motivated assumption that wh-agreement in- 
volves feature sharing between the wh-trace and T. Because small clauses do not contain 
T, a wh-trace in the small clause complement of a causative must share its abstract Case 
feature with the matrix T. As usual, the shared Case feature is spelled out on the verb or 
adjective that projects T’s complement, which in this case is the causative na’. 

It may seem even more surprising that the possessor agreement that ought to be real- 
ized on the nominalized verb is spelled out on what is apparently the inner predicate. I 
contend that what lies behind this unusual spell-out is the prosodic deficiency of na’. In 
Chamorro, affixes must attach to phonological words. This point emerges most clearly 
for suffixes, perhaps because suffixation invariably causes primary stress to shift to the 
penultimate syllable of the suffixed word. Since na’ is not a phonological word, but 
rather prosodically deficient, the suffix that realizes possessor agreement must attach 
instead to the phonological word immediately dominating it - the phonological word 
formed by stray adjunction of na’ to the inner predicate. This, I claim, is responsible for 
the unusual location of possessor agreement in (32-33). 

One might wonder how the same facts would be handled by a more traditional anal- 
ysis of Chamorro causatives that treats na’- as a derivational prefix. Such an analysis 
could deal straightforwardly with the spell-out facts just described, because it takes 
the combination of na’- plus the inner predicate to be a complex word (and therefore 
a phonological word). It would, however, have more trouble with the evidence provided 
by wh-agreement that both the subject and direct object of the inner predicate are li- 
censed by (different instances of) small v. This is because the more traditional analysis 
assumes that there is just one verb, and therefore just one small v, in the structure. 

It should be noted that Chamorro has no double object verbs — no verbs whose small v 
licenses more than one DP as a direct object. Verbs of transfer, for instance, have just one 
DP that activates the object form of wh-agreement when it undergoes wh-movement - 
namely, the DP that realizes the theme (not the DP that realizes the goal; see Gibson 1980: 
161-163). What this means is that a more traditional analysis of Chamorro causatives will 
have to stipulate that the derived causative verb, exceptionally, has two arguments that 
activate this form of wh-agreement. But no such stipulation is needed in the small clause 
analysis of this construction, as we have just seen. 


16 A reviewer asks if -in- infixation might target the phonological word containing the relevant verb. It might 
indeed. However, what matters here is that infixation does not target the phonological word consisting 
only of the inner predicate (which, recall, is distinct from the phonological word consisting of the inner 
predicate plus na’). This can be seen from the ill-formedness of *na’-lini’e’-ña as opposed to nina’-li’e’-fia 
'she caused to see'. More generally, it is hard to locate Chamorro evidence that prefixes and infixes must 
attach specifically to phonological words (as opposed to just any phonological material). 
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7.2 Morphological case 


I mentioned in 82 that Chamorro has three morphological cases - unmarked, oblique, 
and local - and that subjects and direct objects occur in the unmarked case. We must 
now confront the fact that a causative sentence has just two DPs in the unmarked case: 
the subject of na' and the subject of the inner predicate. The direct object of the inner 
predicate, if there is one, occurs in the oblique case. See the examples below. 


(34) a. Hu  na'-ayao si  Isidroni kareta. 
P.AGR CAUS-borrow UNM Isidro OBL car 


‘I let Isidro borrow the car? (CD, entry for ayao) 


b. Maila’ ya baihuna-lii haoni cha'kagi kodu-mu. 
come and P.AGR CAUS-see you OBL rat LOC arm.muscle-Poss 


“Come and let me show you (lit. I will make you see) the rat in your arm 
muscle: (CD, entry for cha’ka) 


This pattern raises a question. Given the wh-agreement evidence that the subject and 
direct object of the inner predicate are licensed in the same way (by a small v), why do 
these DPs occur in different morphological cases? 

In minimalist syntax, one way of resolving disconnects between morphological case 
and morphological agreement is to take case to reflect some mechanism other than li- 
censing by a syntactic head. The mechanism usually invoked is case competition (also 
known as dependent case assignment; see Marantz 1991 and many others since). The 
leading idea behind case competition is that if two DPs are in the same local domain, 
independent of each other, and not already case-marked, the presence of one DP will 
cause the other DP to be assigned case. 

Baker (2015) develops a theory of structural case in which various aspects of case com- 
petition are parameterized, including the local domain in which the DPs occur and the 
specifics of the c-command relation holding between them. In his theory, dependent 
case can be assigned in two local domains, VP and TP, which are the spell-out domains 
of phases. Significantly for our purposes, the evidence that VP is a local domain comes, 
in part, from causative sentences in Chamorro. Baker (2015: 137-139) assumes that Cha- 
morro causatives are morphologically complex verbs, and therefore causative sentences 
have a single VP that contains the complex verb's direct object (= the subject of the inner 
predicate) and can contain another DP (- the direct object of the inner predicate). The 
dependent case assignment that he proposes for Chamorro is essentially as follows. 


(35) Baker (2015) on dependent case assignment in Chamorro 


a. Suppose DP, has not been marked for case. If DP; is cccommanded by DP; 
and both are in the VP domain, assign DP, oblique case; 


b. Otherwise, assign DP; unmarked case. 
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As he observes, this case assignment handles the distribution of oblique versus un- 
marked case in causative sentences as well as other clause types (e.g. clauses constructed 
from verbs of transfer). 

Obviously, this proposal does not mesh well with the analysis of Chamorro causatives 
being explored here. The small clause structure I proposed in §6.1 for causatives locates 
the subject and the direct object of the inner predicate in different VP domains (see (31)); 
this will prevent dependent case assignment from occurring. However, Baker’s theory 
of case allows structural case to be assigned under Agree or through case competition, 
and this suggests other options. 

Baker takes unmarked case to be the default case in Chamorro. Suppose we take 
the opposite position and declare oblique case to be the default case. Then the task 
becomes to assign unmarked case to the various types of Chamorro DPs that exhibit 
it.18 Among the DPs that occur in the unmarked case are subjects, possessors, and DPs 
in topic/focus position. These DPs are the specifiers of the functional heads T, D, and 
C, which license them via Agree (see Chung 1998). Moreover, each licensing relation 
gives rise to some type of morphological agreement: person-and-number agreement (for 
subjects), possessor-noun agreement (for possessors), or operator-C agreement (for DPs 
in topic/focus position). All this suggests that unmarked case is assigned to these DPs 
under Agree. 

Direct objects also occur in the unmarked case, where the “direct object” of a causative 
sentence is the inner predicate’s subject but not the inner predicate’s object. Since direct 
objects - including the inner predicate's object - are licensed by transitive small v via 
Agree, the obvious move is to try to get their case to follow from a more limited version 
of that relation. I claim that unmarked case is assigned to these DPs under Agree, but 
only when transitive small v is selected by T. The italicized extra requirement may look 
stipulative. But there is evidence from several areas of Chamorro grammar that feature 
sharing occurs between small v and the T that selects it. Number agreement spells out 
not only the number feature of small v but the finiteness and mood features of the T 
that selects it (see $6.1). Further, the morphological operations responsible for person- 
animacy effects in Chamorro require that this feature sharing extend to person and other 
features of the DPs licensed by these heads (see Chung 1998; 2014). This feature sharing 
can be achieved in multiple ways which, frankly, are not of particular interest. What 
is relevant is that case assignment to direct objects can now be understood as follows: 
unmarked case is assigned by transitive small v under Agree, but only when it shares 
features with T. 

This achieves the desired outcome. In causative sentences, the subject of the inner 
predicate will be assigned unmarked case, because it is licensed in an ECM configuration 
by a small v (= v; in (31)) that shares features with T. But the direct object of the inner 


17 The local case does not enter into the picture, because it is not a structural case. 

18 The unmarked case is also used for predicate nominals and the objects of most overt prepositions. The 
oblique case is also used for various DPs treated in Chung 1998 as objects of null prepositions: passive 
agents, instruments, and DPs that realize the complements of antipassive verbs, other intransitive predi- 
cates, and nominalized verbs. It is unclear whether the proposals for case assignment in the text can, or 
should, be extended to these other uses. 
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predicate will not be assigned unmarked case, because it is licensed by the embedded 
small v (= v2 in (31)), which does not enter into a feature sharing relation with T. This 
DP will instead be assigned oblique case by default. 

The Agree-based case assignment that I have just proposed is summarized below. 


(36) Agree-based case assignment in Chamorro 


a. Assign unmarked case to DP if it is licensed by T, D, C, or by a transitive 
small v that shares features with T; 


b. Otherwise, if DP has not been marked for case, assign oblique case. 


This case assignment handles the distribution of unmarked versus oblique case in caus- 
ative sentences as well as clauses constructed from verbs of transfer and various other 
clause types. In other words, it has the same empirical coverage as Baker's dependent 
case assignment for Chamorro, but does not require causatives to be analyzed as com- 
plex verbs.? A more sustained comparison of the two approaches to Chamorro case 
assignment is better left for another time. My goal here is merely to show that it is pos- 
sible to give a coherent description of morphological case in causative sentences within 
the small clause analysis I propose. 


8 Conclusion 


Chamorro has many types of inflectional material that could perfectly well be analyzed 
as affixes or clitics; for instance, the material that realizes person-and-number agree- 
ment (and - conceivably - even the material that realizes number agreement). I hope to 
have shown here that the same freedom of analysis, when extended to material that is 
apparently derivational, can have thought-provoking theoretical consequences. 


Abbreviations 
AP antipassive P.AGR  person-and-number 
CAUS causative agreement 
comp complementizer PASS passive 
EMP emphatic POSS possessor agreement 
FUT future PROG progressive 
INFIN  infinitive Q question 
LOC local RECIP reciprocal 
N.AGR number agreement UNM unmarked 
OBL oblique WH wh-agreement 


19 Mark Baker (personal communication) observes that a dependent case account of Chamorro morphological 
case can be maintained if the VP embedded under na’ is what he calls a ’soft phase’. For reasons of space, 
the details are not spelled out here. 
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Sources for the Examples: 

CD CD database: The database for the revised Chamorro-English dictionary. 
Saipan, CNMI and University of California, Santa Cruz 

EM Borja, Joaquin F., Manuel E Borja, and Sandra Chung. 2006. Estreyas Mari- 
anas: Chamorro. Saipan, CNMI: Estreyas Marianas Publications. 

MAK Marciano, Dolores. n.d. Mannge’ na Aláguan Kalamása. National Dissemina- 
tion and Assessment Center, CSULA 

NT 2007. Nuebo Testamento [The Chamorro New Testament]. Diocese of Chalan 
Kanoa, CNMI 
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Chapter 14 


Preliminaries to the investigation of 
clitic sequencing in Greek and 
Indo-Iranian 

Mark Hale 


Concordia University, Montréal 


Beginning with Wackernagel (1892), scholars have dedicated a great deal of attention to 
the question of the placement of enclitic elements within their clause, particularly those 
which (tend to) appear in so-called “Second Position”. Anderson (2005) summarizes the 
honorand’s long-standing interest in and contribution to the critical “interface” issues which 
arise for linguistic theory from these elements. As has been well known, again since at least 
the time of Wackernagel’s writings on the matter, several archaic Indo-European languages 
enjoy particularly rich inventories of relevant elements. Greek, in particular, has a set of 
toneless (“enclitic”) and tonic (“postpositive”) so-called “second position” lexemes. Since 
only one entity can technically occupy “second position” in any given string, an obvious 
empirical issue arises regarding clauses which contains multiple “second position” elements. 
This matter has received significantly less careful attention than has "second position clisis" 
generally in the past 125 years. In this paper I present a detailed analysis of what happens 
when multiple elements seem to have a demand on "second position" in the Attic Greek 
clause (focussing on the Plato and Euripides corpora), and demonstrate that in developing 
an account for why the ordering is the observed one, a richer understanding of the actual 
mechanisms behind our (ultimately epiphenomenal) Wackernagel's Law can be developed. 


1 Introduction 


In Anderson (2005) the honorand of this volume presented a detailed analysis of clitic 
placement, including the positioning of so-called "second position" (henceforth, 2P) cl- 
itics. I have neither the competence nor the space to fully engage with his insightful 
and intriguing proposals in this venue. However, given that his analysis was developed 
against the backdrop of a consideration of empirical data from a broad set of languages, 
we can all recognize that it is necessary for such cross-linguistic approaches to abstract 
away from a certain number of seemingly low-level technical matters which arise in in- 
dividual linguistic systems so as to not impede the development of a general theory. In 
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some cases, one is leaving to one side matters which are both well described and well 
understood in the specialist literature on the language in question. However, in the case 
of what is perhaps the most famous data on 2P clitics - the Greek and Indo-Iranian data 
made use of by Wackernagel (1892) in the grounding document for so-called “Wacker- 
nagel's Law” - this is not the case. Surprisingly, the empirical data for these languages 
is relatively poorly understood, in my view, even in the specialist literature. 

It should be clear that building a general theory of clitic behavior in human linguistic 
systems is only going to be as successful as the quality of the empirical data on individual 
languages used as input to the theory construction process allows. The more poorly such 
data is described, or understood, the potentially weaker the resulting general theory. In 
this paper I focus on one aspect of clitic behavior in systems of the Greek/Indo-Iranian 
type: the sequencing of 2P clitics. I argue that the weakness in our capacity to insight- 
fully account for observed sequencing of 2P clitics highlights the shortcoming of our 
understanding of the phenomenon in these languages generally, and point toward some 
ways we might, in my opinion, approach these issues so as to improve our understand- 
ing. This would allow the Greek and Indo-Iranian data to play their rightful role in the 
evaluation of more broadly-based theories such as that of Anderson (2005). 


2 Wackernagel's so-called Law 


The literature on so-called “second-position” clitics goes back at least to Bartholomae 
(1886). Wackernagel's Law universally recognizes a tendency for certain types of prosod- 
ically deficient element to occupy "second position" in the clause in Ancient Greek and 
Indo-Iranian (at least). A very clear statement of Wackernagel's own version of his “law” 
can be seen at the beginning of his famous Vorlesungen über Syntax (Wackernagel 1920: 
7). 


Z. B. im altesten Griechisch, in sehr hohem Masse bei Homer, auch noch bei Hero- 
dot, ist das Gesetz lebendig, dass schwach betonte Wórtchen, welches immer ihre 
syntaktische Beziehung sei, unmittelbar hinter das erste Wort des Satzes gestellt 
werden.? 


In spite of the 130 years which has passed since Bartholomae first posited the tendency, 
and the heaps of follow-up scholarly literature, work which addresses the specific issue 
we will concern ourselves with today - the sequencing of 2P clitics — in any serious 
way is quite sparse. Early work was impeded by the descriptive goals of traditional Indo- 
European studies: after all, if you only need to catalogue observed sequences of 2P clitics, 
the task simply involves gathering and reporting on the data provided by one's corpus. 


1 I think it is widely recognized that these two systems are very similar to one another, in an Indo-European 
context, both in terms of the richness and diversity of their clitic inventory and in the syntax of these 
elements. 

2 “For example, in the earliest Greek the law is active - to a very large extent in the case of Homer, also 
still in the case of Herodotus - which holds that weakly stressed ‘little words’, whatever their syntactic 
relationship might be, are placed directly after the first word of the clause.” 
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However, when we try not to describe the superficial properties of a string, but to view 
that string as the epiphenomenal output of a computational device (the grammar), the 
system needs to be imbued with powers which will enable it to bring the linear order of 
the output string into existence, rather than just describe that order after the fact (the 
way the researcher does). 


3 The “clitic cluster” 


The various modern Indo-Europeanist approaches to Wackernagel phenomena in Greek 
and/or Indo-Iranian reference different methods for dealing with the sequencing prob- 
lem. One approach, commonly seen in work that takes a “prosody-centric” view of 
things, posits a “second position” clitic “cluster” (or “chain”) into which the clitics are 
placed, and assumes some kind of (usually stipulative) ordering algorithm which ar- 
ranges the clitics within this cluster. This is my understanding of the general approach 
(though, of course, they differ in detail) of Keydana (2011), Lowe (2014) and Goldstein 
(2016). A rough graphical representation of such approaches is presented in Figure 1. 


cl cl cl 


aD 


2P Clitic Cluster 


Ordering Algorithm 


Figure 1: The “Clitic Cluster” 


Typically, the domain within which 2P is defined is taken in these approaches to be 
prosodically defined, and the entities which undergo 2P placement (the clitics) are taken 
to be a prosodic class, but I have never seen an empirically-grounded proposal claiming 
that the observed ordering within the clitic cluster is determined by prosodic consid- 
erations alone. I imagine that this is because it is pretty hard to see any trace of the 
domination of such a factor in the observed clitic sequences in these languages. 
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But in general these authors have not yet ventured a systematic hypothesis about the 
sequencing, so it is hard to determine in any detail the nature of the envisioned system. 
For example, Keydana (2011: 108, fn. 3) says “[a]nother issue not to be addressed in this 
paper is the internal structure of clitic clusters.” Goldstein (2016: 88) says that because 
the matter is a “difficult issue”, he will “leave it for future research, and for the moment 
assume templatic ordering.."? Lowe (2014: 28) writes that “there are regular (though 
not inviolable) orders when more than one element from any one of those categories 
occurs...It is beyond the scope of this paper to account for those patterns.” 

Interestingly, each of the authors does seem to feel that the observed ordering is re- 
lated to “classes” into which the clitics fall. Thirty years ago (Hale 1987: 73), I argued 
that there were three distinct classes of 2P clitics, taking distinct 2Ps for distinct rea- 
sons. Although these are not the labels I would use if I were creating them today, and 
the mechanisms described in 1986 for how the categorization of the clitic relates to its 
placement are woefully antiquated, the classes were: (1) “emphatic” clitics, (2) “sentence 
connective" clitics, and (3) “sentential” (usually pronominal) clitics. The first I took to in- 
volve word-level attachment and the second clause-level. The third were constrained to 
appearing in a very low position in what we would now call the CP domain, or, indeed, 
at the top of the IP domain. 

I wrote then that “[t]he position of these elements relative to one another follows 
naturally from this account of their origins: the regular sequence is emphatic + sentence 
connective + sentential...” It is hard to see this statement as anything more than either 
wishful thinking or blissful ignorance, both of which I possessed in spades back in those 
days. From my crankier contemporary perspective, it is pretty easy to see that no explicit 
characterization of the membership in these classes was provided (though I gave isolated 
examples of each), nor was any mechanism even hinted at for what might trigger any 
specific ordering in cases of multiple instantiations of one of these categories in a single 
clause. In these matters, unfortunately, I have been largely followed by more recent 
work. 

I think it is clear enough, however, what we would all like to see. Overt stipulation is 
in essence an admission of explanatory failure, and all principles of the scientific pursuit 
demand of us that we attempt to minimize the role of stipulation in our models. So, our 
hope must be that the clitics fall into non-arbitrarily-defined classes, and that these clitic 
classes occupy well-motivated positions (relative to the functions the clitics instantiate) 
in the linguistic representation. Since it is safe to assume that no two clitics do exactly the 
same work and co-occur in a single clause, if order falls out in some way from function, 
we should always be able to generate a predicted ordering for any pair of clitics. It is the 
“in some way” that I want to explore today. As a step in that direction, though, I must 
dwell a little longer on previous work. 


3 No explicit template is provided which accounts for the cited data. 
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4 The Hock template 


Around the same time as I was writing the account discussed above, Hans Hock was 
working out the details of an overtly “templatic” approach to clitic sequencing in Vedic 
Sanskrit (e.g., Hock 1996, with earlier literature). The Hock system was not a model of 
internal consistency. In this system, the 2P clitics fall into three classes, P, P, and D: 


(1) P: atonic non-'deictic" (i.e., non-pronominal) clitics 
P: tonic non-“deictic” clitics (sometimes called “postpositives”) 
D: atonic “deictics” (i.e., pronominal clitics) 


Clearly, the classification is based on both prosodic (tonic vs. atonic) and functional con- 
siderations (deictic vs. non-deictic). The template is basically stipulative, but in its most 
common instantiation, shown in (2), it is supposed to be partially motivated: non-deictics 
precede deictics (thus P, P before D, D) and atonic elements from each class precede the 
corresponding tonic elements (thus P before P, D before D). 


(2) X P P D D 


The template is often described as “phonological” or “prosodic” (including by Hock him- 
self), but that, of course, is not accurate. It is important to note that no motivation is 
cited for either of the two ordering principles: no reason for why non-deictics should 
come earlier in the clause than the deictics is given, nor for why the atonic element of 
each category should precede the tonic one.* 

The graphical representation of the template in (2), especially when coupled with the 
claim that the template is “prosodic”, invites the inference that the "goal" of the system is 
have an alternating strong vs. weak tonicity sequence. But since P, P and D can all double, 
and since any of those elements can be absent in any given clause, no such alternating 
pattern is observed in the vast majority of utterances in the actual corpus. The prosodic 
motivation for the template is thus highly abstract (not that that determines whether it 
is accurate or not). 

My actual point in discussing the Hock system, however, is to point out that the mech- 
anisms involved are quite distinct from that which we saw in our discussion of *clitic 
cluster" approaches to clitic sequencing. We can visualize the system as seen in Figure 2 
below. 

There is no "second position" in this conception of things, but a variety of ordered 
positions into which clitics are placed based on their properties (prosodic and functional). 
We can see the differences between the two general models if we imagine a clause with 
only a single 2P clitic in it. In the “clitic cluster" approach that clitic will be “in 2P", plain 
and simple. The ordering algorithm will have no work to do. In an approach such as that 
found in the Hock template, we can (and must) still ask the question: where is the clitic? 


^ In fact, D elements may appear in the X position and, for Hock's version of the Rigvedic template, also 
in the P position, and thus come earlier in the clause than D elements, indicating that there isn't much 
content to this latter principle in any event. 
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cl cl cl 


Figure 2: The “Hock template” 


It could be in P (with P and D empty), or in P (with P and D empty), or in D (with P and 
P empty). 

There is, of course, an inherent advantage, given that we are seeking to develop a 
non-stipulative account of ordering, if we assume that specific clitics map consistently 
to specific positions in the string, and that whether that position ends up being “second” 
or “third” or “fourth” is simply the epiphenomenal by-product of the mapping algorithm. 
A model which, by contrast, dumps the clitics unordered into a 2P cluster and then must 
impose an order on them has, in some sense, missed a chance to impose that ordering ear- 
lier in the initial mapping. Particularly if that mapping is prosody-driven, only a general 
slackening of constraints on the relationship between position (at the point of insertion) 
and interpretation — constraints which seem to hold of the linguistic representation gen- 
erally - can allow a set of unordered elements, placed by a prosodic algorithm, to be 
realigned in an interpretationally-relevant manner (i.e., on the basis of the "functional 
class" of the clitic). 

Hale (1996) represents an attempt to link the "positions" in the Hock template to spe- 
cific structural elements within the clause (a similar orientation is found in other syntax- 
centric but prosody-sensitive approaches). I will not dwell on that reinterpretation of 
Hock here, but instead turn to one more modern model of 2P placement. 


5 Wackernagel's optimality-theoretic approach 


Wackernagel himself pretty clearly conceived of the 2P phenomenon as a kind of a com- 
promise between a clitic element representing the kind of information which should 
come early in the clause (generally because of its "linking" function to the preceding 
context) and that same element being atonic, and thus not particularly suitable for ini- 
tial position. This traditional conception is very closely followed by modern Optimality 
Theoretic approaches to 2P placement, as reflected, for example, in Anderson (2005). 
The basic idea of such approaches, as with Wackernagel's, is that there is a drive for 
2P clitics to be initial (captured by an ALIGNLEFT constraint, which incurs one violation 
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for each step away from the left edge a clitic is) and a prohibition against the clitic ap- 
pearing in initial position (captured by a NonINITIAL constraint). Obviously, both of 
these constraints cannot be satisfied in the same string. As always in OT, the conflicting 
demands are resolved via a ranking of the constraints. As long as NonINITIAL outranks 
ALIGNLEFT, the clitic will appear as close as possible to string-initial position, but not in 
initial position — hence the 2P effect. 

We can see this with a tableau fragment concerning the positioning of Greek gár 
"because, since' relative to two tonic lexical items, which I designate simply as X and Y. 
Greek gár is a standard example of a "Wackernagel's Law” element in the language.” 


X, Y, gar || NonInrriat | ALIGNLEFT(gar) 
X Y gár its 

X gár Y S 

gar X Y a! 
gar YX *! 
Y gár X : 
Y X gár ar 


(3) 


afr} oe} alo | sip 


Wackernagel believed that the drive towards initial position was of different strength 
for different enclitic objects. As long as we make the ALIGNLEFT constraints clitic (or 
clitic class) specific, a strict ranking version of OT will require us to decide for each clitic 
just how strong the demand is that it be aligned left. For example, if we take the Greek 
focus-marking enclitic mén, we are compelled by the model to ask: is it more important 
for mén to be on the left, or for gar to be on the left? (In Wackernagel’s terms, is the 
“drive for initial position” stronger for mén or for gar?) 

The tableau in (4) below shows how such a system generates — again by necessity — a 
sequencing of clitics. We should always be attentive, I think, when our modern models 
seem to converge on the ideas of important scholars like Wackernagel, who could not 
have envisioned the workings of OT, and thus independently had a similar conception 
of a certain class of linguistic phenomena. If Wackernagel and Anderson agree, it would 
be wise to not dissent too quickly! But is OT the happy confirmation of Wackernagel's 
less formal and more intuitive analysis? 

The model in some sense combines properties of the two earlier approaches I sketched: 
there is a sense of a "fixed stipulative order" (as seen in the Hock Template) in the rela- 
tive rankings of the ALIGNLEFT(x) constraints, but there is no sense of fixed stipulative 
positions in the resulting representation (as that model implies), thus making it more like 
the “clitic cluster" analysis. It has the advantage, over the “clitic cluster” analysis, of not 
adding a new object to our model of the grammar (the “clitic cluster"), nor requiring a 
distinct computational process which is responsible for explicitly ordering the elements 
within the cluster (the “ordering algorithm” mentioned above). 


5 The reader will notice that this tableau fragment does not select between two “winning” outputs: X gar Y 
and Y gár X. Obviously other constraints will determine the ultimate optimal output form with respect to 
the relative ordering of the tonic elements to one another. 
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(4) EE NoNINITIAL | ALIGNLEFT | ALIGNLEFT 
Acus (ct) (mén) (gar) 

a. X Y gar mén il HE 
b. X Y mén gár 7 X 
c. X mén Y gár $ 2M 
d. mén X Y gár ji E 
e. X gár Y mén SE B 
f. X gár mén Y Gel » 
g. © XméngárY i " 
h. mén X gar Y = Le 
gar X Y mén S GC 
j. gár X mén Ý E! xd 
k. gár mén X Y S a 
L mén gar X Y S E 
m. gár Y X mén =] SC 
n. gar Y mén X = s 
o gar mén Y X il d 
p. mén gár Y X "f i 
q. Y gár X mén Sa x 
r. Y gár mén X el i 
s. © Ymén gar X l u 
t. mén Ý gár X Si Ms 
u. Y X gár mén SIT E 
v. Y X mén gár ul as 
w. Y mén X gár S Cl 
x. mén Y X gár A Ges 


I have both conceptual and empirical concerns about the model. At the conceptual 
level, stipulation is built into the model pretty deeply: the so-called “factorial typol- 
ogy” argument says that we could just as easily have ALIGNLEFT(gar) outrank ALIGN- 
LEFT(mén), and indeed that every ordering of every available ALIGNLEFT(x) constraint 
should be, in principle, observed. This does not seem to me to be very consistent with 
the data I have seen from archaic Indo-European languages (which often involves ety- 
mologically unconnected, but functionally similar enclitics showing the same ordering 
principles). 

Empirically, one of the great challenges to this model holds, in my view, for all of 
the other models we have treated to this point as well. The domain over which the 
ALIGNLEFT(x) constraint must be assessed is, in the model, a pure stipulation (i.e., there 
are no principles regulating what a given enclitic might be "aligned" to). The same prob- 
lem, in my view, plagues the so-called “prosody-centric” approaches which have become 
popular: the clitic is said to move into the “clitic cluster" in second position of some do- 
main, but none of the approaches I have seen (Keydana 2011; Lowe 2014; Goldstein 2016) 
present a non-stipulative characterization of how the appropriate domain is established. 
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A clitic is said to take 2P within (some) Intonational Phrase, but the question of which 
one (when there are multiple IPs in the string) is left unclear. It also seems like the “In- 
tonational Phrase" portion of the specification (and, in non-OT approaches, the 2P part 
as well) is purely stipulative: could one place a clause-conjoining clitic in 2P of (some) 
Utterance Group or (some) Phonological Word? How about in 3P in the Intonational 
Phrase? The models proposed are so inexplicit it is difficult to determine precisely what 
their allowable elements and operations are - the models are thus both highly stipulative 
and poorly constrained. 

As mentioned above, my concerns about the Hock Template have been spelt out in 
considerable detail in Hale (1996), where I attempt to reduce the empirically valid aspects 
of the template to the interaction of (1) normal syntactic placement of the relevant enti- 
ties and (2) Halpern's (1995) “prosodic inversion". I won't go through these details here, 
except to note that in this conception of things enclitic elements can be in a variety of 
structural positions (in the syntax) and may (or may not) undergo “prosodic movement" 
in the phonology (depending on whether they are “properly hosted" without such move- 
ment). 

It is obvious that a system which leverages both syntax and phonology to account for 
observed clitic placement is in some sense more complex than either a purely syntactic 
or a purely phonological one. But the evidence that the process is not purely syntactic is, 
as far as I can tell, universally accepted. The 2P clitics often interrupt manifest syntactic 
constituents (in a manner the syntax does not allow) and are quite regularly placed in 
positions in the string from which appropriate scope relations could not possibly be 
established. "Prosodic" rearrangement allows the syntax to be mundane rather than 
strikingly bizarre (with all of the implications such bizarreness would have, if allowed, 
for our theories of grammar). 

That the syntax is involved is, in my view, absolutely required if we are to solve the 
"domain" problem. Sanskrit ca and Attic te are 2P clitics, but they appear in clause-second 
position only when their domain is the clause. They appear in “DP-second” position 
when their domain is the DP, and in VP-second position when their domain is the VP. 
Their domain simply cannot be defined with respect to prosody alone. 


6 Sample application: Enclitic subordination 


Both of these observations can be seen to be at work in a set of examples involving 
enclitic subordinators. Before examining this data, which will also display the mecha- 
nisms I believe are at work in clitic placement (and thus what we might use to explain 
clitic sequencing), I will remind the reader that Hale (1987) showed that there was a 
process manifest in the language of the Rigveda, in Avestan, and in Greek, whereby a 
single constituent could be fronted to the left of a WH-element. Obviously, this analysis 
extends (though the matter was not overtly discussed - but rather implicitly assumed - 
in that earlier work) to other subordinators present in C (‘if’, ‘when’, "because, I will 
call that fronting process "topicalization" (and the position into which the fronting takes 
place Top), with no particular commitment to the discourse functions involved. 
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Familiar examples of this fronting looks like this: 


(5) a. [á$mànam cid] yé bibhidür vácobhih 
rock-Asg Emph.cl Rel-NPIM smashed-IIIPl words-ISgN 
‘who smashed even rock with (mere) words, ` RV 4.16.6c 
b. [idhmám] yás te jabhárac 
kindling-ASg Rel-NSg you-DSg.cl would bear-IIISg 
chasramanah 
exerting himself-NSg 
‘who, exerting himself, would bear the kindling to you... RV 4.12.2a 


Armed with the following assumptions, then, let us see what some structures look like, 
and how they might impact the development of a theory of clitic sequencing: 


e enclitic elements are placed in “expected” syntactic position 


e “prosodic inversion" is triggered when they are not properly hosted on their left 


edge 


Wackernagel proposed long ago that there were traces in the Rigveda of a reflex of IE 
"k"e which (like OLat. absque me esset ‘if it were without me’, some uses of Gothic nih, 
and, although not known to Wackernagel, Hittite takku) is subordinating in function, 
generally rendered ‘if, when’ As might be expected, the verb in such clauses, as in 
subordinate clauses in Vedic generally, is accented. This has given rise to some anxiety 
that the true subordination marker is the verbal accent, and that ca is simply (weakly) 
coordinating. Typical examples are: 


(6) a. niuptas cababhrávo vácam ákratam 
scattered-down ca brown-NPI voice-ASg they made 
émid esàm niskrtám jariniva 
I go=PTCL their; appointed place-ASg paramour-NSg-like 
‘And as soon as, scattered down, the brown (dice) have raised their voice, I 
just go to their appointed place, like a girl with a lover’ (SJ/JB) 10.34.5cd 
b. tuvam ca soma no vá$o 
you-NSg ca Soma-VSg us,; you should wish 
jivatum ná maramahe 
to live NEG we will die 


‘And if you will wish us to live, Soma, we will not die’ (SJ/JB) RV 1.91.6ab 


I have provided the translation of Jamison & Brereton (2014) because it reflects directly 
the unease that some Vedicists feel about “subordinating” ca: they have translated ‘and 
as soon as (=when)’ and ‘and if’, leaving it unclear whether they believe that ca is coordi- 
nating (‘and’) and the verbal accent subordinating (‘as soon as’, ‘if’), or whether perhaps 
they believe that this ca actually means ‘and as soon as/if’. 
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As one can imagine, determining whether or not such clauses are weakly connected 
to the preceding discourse - i.e., whether the ‘and’ should be in the English translation 
- is no easy task. And from this example, and other widely accepted ones (such as RV 
8.21.6ab below), in which ca occupies second position, it doesn’t seem likely that our 
approach to clitics is going to be much help in this task. 


(7) áchà ca tvainá namasa vadamasi 
PV ca you,;=this-ISg homage-ISg we address 
kim muhus cid vi didhayah 
Q-marker for a moment even,; PV you will think 
"When we address you with this homage, will you hesitate even for a moment?’ 
RV 8.21.6ab 


Hettrich (1988: 252) notes overtly on subordinating ca that ca stands “wie nach Wacker- 
nagels Enklisengesetz[fn deleted] zu erwarten, überwiegend an zweiter Stelle im Satz.” 

It is somewhat striking to see the überwiegend in this statement, because coordinating 
ca is extremely regular in its “second position" behavior, being postponed only in cases 
in which there is a “phonological word" at the start of the conjoined domain. Are there 
actually cases of "late ca" in subordinating function? The following examples seem to 
answer this question "yes". 


(8) a. asyá$slóko divíyate prthivyám 
his call-NSg heaven-LSg-speeds earth-LSg 
átyo na yamsad ` yaksabhíd 
steed-NSg like will control bringing-wondrous-apparitions-NSg 
vícetah 
discriminating-NSg 


mrgánam ná hetáyo yánti cemá bíhaspáter 
wild beasts-GPl like charges-NPI they go ca-these-NPl Brhaspati-GSg 
ahimayarh abhi dyin 


having-snake-wiles-APl to ^ heavens-API 

"Ihe discriminating one [-Brhaspati?], like a steed, bringing wondrous 

apparitions, will control it when these (words) of Brhaspati, like the charges 

of wild beasts, go to the snake-wiles-possessing heavens: RV 1.190.4 
b. ubháyam Srnavac cana 

twofold-ASg will hear ca us; 

indro arvag idám  vacah 

Indra-NSg nearby this-ASg speech-ASg 

satráciyà maghava sómapitaye 

fully-focussed-ISg benefactor-NSg soma-drinking-DSg 


D * as would be expected according to Wackernagel's Law, overwhelmingly in the second position in the 


clause? 
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dhiya $ávistha a gamat 

thinking-ISg most-powerful-NSg PV will come 

"When Indra nearby will hear this twofold speech of ours, the most powerful 
benefactor will come here to the soma-drinking by reason of our fully 
focussed insight: RV 8.61.1 


To understand this data (and there are one or two more examples), we need to ask the fol- 
lowing question: how does conjunctive ca end up in "second position" when it conjoins 
clauses, and how would we expect a “subordinating ca” to behave given the assumptions 
outlined above? 

The behavior of coordinating ca is fairly straightforward. No matter what kind of 
syntactic (or prosodic!) entity ca is coordinating, it appears in this configuration: 


(9) XP 


ca XP 


pes 


XYZ 


[X Y Z] can be a clause, as in the case under discussion, or an DP or VP or PP or whatever. 
Obviously, ca sits at the left edge of the XP-domain and thus does not have a proper 
prosodic host to its left. It therefore must undergo the “prosodic flip”, and, assuming X 
is a “phonological word", the resulting operation will give rise to this string: 


Lm] 
(10 caXcaYZ 


It should be clear that one thing that ca can bring into a coordination relationship with 
what precedes it is a clause introduced by a topicalized phrase. Here's an example. 


(2) yád agna esá sámitir bhavati 

when Agni-VSg this-NSg assembly-NSg will become 
devi devésu yajata yajatra 
godly-NSg gods-LPl sacrifical-NSg sacrificial one 
ratna ca  yád vibhajasi 
treasures-AP] and; when you will share out 

svadhavo 

having-independent-will-VSg 
bhagam no átra vásumantam vitat 
share-ASg us, then rich-in-goods-ASg pursue 
"When, o Agni, this assembly will become godly among the gods, a sacrificial 
one, o sacrificial one, and when you will share out treasures, o you of 
independent will, then pursue a share for us rich in goods: (SJ/JB) 

RV 10.11.8 
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The c-pada of this verse arises from a syntactic structure of the form: 


(12) TopP 
Be ae 
ca TopP 
"cm 
ratna CP 
Or SS, 
yad IP 


vibhájasi svadhavo 
So where do we expect subordinating ca to be syntactically? Well, it is isofunctional 
with yád in the clause above. So the structure of the subordinate clause in (6a) at the end 


of syntactic computation would presumably be: 


(13) CP 


ca IP 


[nfuptas babhrávas] vácam ákrata 


Since ca is alone up there in the left periphery, it must undergo the “prosodic flip" in the 
phonology to be properly hosted. 

What would happen if we were to have a Top element in such a clause? In the case 
of coordination, it of course is the entire clause, including its initial Topic-phrase, which 
gets coordinated to a preceding (or following) clause, and ca thus dominates TopP. But 
topicalized material appears to the left of the subordinator (relative pronouns, yádi ‘if’, 
etc.), so if ca is a subordinator, we predict a structure such as (contrast coordinating ca 
in (12)): 


(14) TopP 


Pam 


Topic CP 
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If the proper “hosting domain” for ca is the CP, then it is unhosted on its left, and we 
predict the phonology to restructure this to: 


| 
(15) Topic [ca X ca Y Z] 


Note that, if there are such examples, since coordinating ca won't act this way, but sub- 
ordinating ca should, we would have clear and unambiguous evidence for the syntactic 
separation of ca into two distinct types of grammatical element. And of course there are 
such examples. We saw them above in (8ab), whose structures are: 


(16) a. [top mrgánam ná hetáyo] [cp ca [yánti ca ima brhaspater áhimayar abhí dyün]] 
b. [rop ubháyam] [cp ca [$rnávac ca na indro arvág idám vacah]] 


The construction as a whole is rare, but confirmation for this analysis is provided by 
the reflex of subordinating ca in later Vedic texts (and, rarely, already in the Rigveda) - 
the subordinating particle céd (etymologically from ca + id). The “normal” position for 
this particle is, of course, in second position. 


(17 a ná vá  aranyànír hanti 
Nec PTCL Lady of the Wilderness-NSg slays 
anyás cén nábhigáchati 


another-NSg céd Nzc-attacks 

'In truth, the Lady of the Wilderness does no slaughter, if someone else does 

not attack. (SJ/JB) RV 10.146.5ab 
b. yó asya üdho ná veda- 

who-NSg her-GSg,; udder-ASg Nec knows 

-atho asyà stánàn uta 

thereto=PTCL her-GSg,; teats-API as well 

ubháyenaivásmai duhe 

both-ISg-PTCL-him-DSg, she yields milk 

datum céd á$akad vasam 

to give céd he was able cow-ASg 


“Whoever knows not the udder of her, and likewise the teats of her, to him 
she yields milk with both, if he has been able to give the cow. (Whitney) AVS 
12.4.18 


But, as with subordinating ca, we find unexpectedly “late” instances of céd as well. 


(18) a. [arthino] yanti céd artham 
having-a-task-NPl proceed céd task-ASg 
gáchàn íd ^ dadáso  ràtím 


they will go to PTCL giver-GSg generosity-ASg 


‘If those having a task proceed to their task, they will attain the generosity of 
the giver: RV 8.79.5ab 
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b. [abandhv éke dádatah ^ prayáchanto] 
kinless-NPl some-NPI giving-NPl bestowing-NPI 
dátum céc chíksànt sá svargá evá 


to give céd they are able this-NSg heaven-NSg indeed 
‘If some, without kin, giving, bestowing, are able to give, this is truly heaven’ 
AVS 6.122.2cd 
c. hédam pa$ünám ny èti 
wrath-ASg cattle-GPl PV comes 
bráhmanébhyó 'dadad vasam 
Brahmans-DPl not-giving-NSg cow-ASg 
[devánam níhitam bhàgám] 
gods-GPl deposited-ASg portion-ASg 
mártya$ cén nipriyayate 
mortal-NSg céd keeps (for himself) 
"Ihe mortal not giving a cow to the Brahmans goes down to the wrath of the 
cattle, if he keeps to himself the deposited portion of the gods’ 
AVS 12.4.21 


And note that we can use our analysis of these “late” instances to make our interpreta- 
tions of certain Vedic passages more precise. Look at the AB passage (from the Sunahsepa 


legend) in (19). 


(19) 


mam asmin ` samnayaty 

debt-ASg him-LSg,; he pays 

amrtatvam ca  gachati 

immortality-ASg and, he goes to 

pità putrasya jatasya 

father-NSg son-GSg born-GSg 

pasyec cej jivato mukham 

he should see ced living-GSg face-ASg 

‘A debt he payeth in him, and immortality he attaineth, that father who seeth the 
face of a son born living’ (Keith) AB 7.13.4 


Keith, whose translation I have provided, takes [putrasya jatasya... jivato mukham] ‘(the) 
face of a son born living’ as a (discontinuous) constituent, the direct object of the verb 
pasyet. That is, his analysis (ignoring for a moment the ced, to which we will turn mo- 
mentarily) is that the subject and predicate divide like this: 


(20) 


[pita] [putrasya jatasya pasyet jivato mukham] 


There are two possibilities for where pita could be under Keith's interpretation: it could 
have been fronted into the Topic position, or, of course, it could be in some position 
lower than C (in Focus, or in IP, e.g.). If it were below C, the output of the syntax (now 
with ced reintroduced) would have been as below, with the “prosodic flip” indicated: 
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(21) [cp ced [pita ced putrasya jatasya pasyet jivatah mukham]] 


If pita were in Topic, we would have instead expected: 


(22) [Topic pita] [cp ced [putrasya ced jatasya pasyet jivatah mukham]] 


Neither of these is the sentence in the text. It is clear what the structure must be if the 
placement of ced is to fit with all the other evidence for the use of this particle in early 
Vedic: 


(23) [topic pita putrasya jatasya] [cp ced [pasyet ced jivatah mukham]] 
‘when the father of a (just) born son; sees the face of (him;) living’ 


7 hi, gar, and clitic sequencing 


In my dissertation (Hale 1987), I dealt fairly extensively with the data from Vedic hi 
‘because, since’. I noted that while the vast majority of instances of hi are in “second 
position” (appropriately defined), there were a number of counterexamples. Note that 
hi occupies, at the end of the syntactic computation, the very same position ("C") as 
subordinating ca and céd. 

I won't bother citing second position instances of hí — as I said, the vast majority of 
the approx. 630 attestations of the particle are in that slot, properly defined. Some of 
the not terribly numerous exceptions are given in (24) below. Several interesting issues 
arise, so I cite a healthy number of the exceptions. 


(24) a. urukramásya sá híbándhur itthá 
wide-striding-GSg this-NSg hí bond-NSg thus 
‘for exactly that is the bond to the wide-striding one’ (SJ/JB) 


RV 1.154.5c 
b. asmáf ca  támé ca pra hi nési vásya á 
us-API and; them-API and; PV hi lead better-ASg PostP 
‘lead both us and them forth to a better state. (SJ/JB) RV 2.1.16c 


c. tribhíh pavitrair ` ápupod dhí arkám 
three-IPl purifiers-IPl he purified hi chant-ASg 
‘Since he [-Agni?] purified the chant with three purifying filters; (SJ/JB) 
RV 3.26.8a 
d. áksetravit ksetravídam hi aprat 
not-knowing-the-field-NSg knowing-the-field-ASg hi asked 
‘Because the one not knowing the field asked the field-knower; (SJ/JB) 
RV 10.32.7a 
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In all of the “exceptions” I will cite here, we can analyze the data just as we did in the 
case of ca and céd: the first constituent of the clause is in the Topic position above the 
CP, hi is in C itself, and is not “properly hosted” by a tonic element within its domain 
on its left, and thus undergoes inversion. Thus we have [Top urukramásya] in (24a), [Top 
asmáii ca tams ca] in (24b), [Top tribhih pavítrair] in (24c), and [rop áksetravit] in (24d). 

In my dissertation, I rather unwisely said, regarding examples such as these, that the 
poets were able to treat the caesura as equivalent to a clause-boundary, and thus place hí 
in second position after the caesura, rather than after the actual start of the clause. This 
is a not a particularly good idea, giving the meter far too much power to determine the 
data — certainly far more than I would be willing to concede at this stage of my research 
on the matter. 

We can give a much more sensible assessment of this data if we instead note that the 
boundary between the element in Topic and the start of the CP-domain is marked by an 
intonational reset (or pause), and that the natural place to align this pause within the 
rhythmic structure of the verse line is at the caesura. In all of the examples above, the 
Topic ends at the caesura of a trimeter line (this will be true of the examples I cite below 
as well). 

As usual, there are many other interesting things going on with these examples as 
well. For example, in support of the topicalization analysis, we see in an example such as 
(24a) a discontinuity (urukramásya...bándhur). We need to account for this discontinuity, 
and movement is the way to do it in our model - topicalization provides the relevant 
explanation for that movement. We will see additional examples of this type below. 

Finally, and returning to the matter of clitic sequencing, we may be able to learn some- 
thing important about how exactly the "prosodic flip" works to trigger specific orderings 
from examples such as those in (25). 


(25) a. índro vidvarh anu hi tva cacaksa 
Indra-NSg knowing-NSg PV hi you-ASg, kept an eye on 
ténahám agne anusista agam 
this-ISg=I-NSg Agni-VSg instructed-NSg have come hither 
‘Because the knowing Indra has kept you in his sights, instructed by him 
have I come here, o Agni’ (SJ/JB) RV 5.2.8cd 
b. sadyó jajfiané ví him iddhó ákhyat 
at once being-born-NSg PV hí-them,; kindled-NSg he observed 
‘for immediately upon being born, he, kindled, observed them’ 
RV 10.45.5c 


Recall that pronominal clitics occupy the lowest position in the C-domain (or the highest 
in IP), so one possible structure for what the syntax would have sent to the prosody for 
(25a) would be: 


(26) [rop indro vidvárh] [cp hi [tva [ánu cacáksa]]] 
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In this structure, neither hi nor tvd can be properly hosted on their left, with the expected 
"prosodic inversion" being thus triggered: 


CN 


(27) [Top indro vidvarh] [cp hi [tva [anu hi tva cacáksa]]] 


However, as we all also know, there are many “exceptions” to the syntactic “weak 
pronoun fronting” that seems to be responsible for making pronominal clitics targets 
for Wackernagel's Law-type effects in archaic IE languages. If the tva of (25a) were to 
represent one of these exceptions, and thus be unfronted, the most likely input structure 
for the prosody would have been: 


(28) [rop índro vidvárh] [cp hi [anu tva cacáksa]] 


Which would have been operated on by the prosody so as to create: 


C xN 


(29) [rop indro vidvarh] [cp hi [ánu hi tvā cacáksa]] 


These two possible analyses have quite different implications for how the system I 
have assumed gives rise to clitic sequencing. In the analysis in (26) we would be looking 
at the effects of iterative prosodic inversion events, and the examples would reveal an (as 
far as I can see somewhat unexpectedly) “outside-in” processing (hi flips in first, then 
tva). 

Under the analysis in (28), we are looking at the relationship between the resolution 
of the hosting needs of an unmoved tvà relative to the ability of an inverting hi to “slip 
in” between tvà and tva’s potential (and ultimate) host ánu. The details of the processes 
involved under the latter set of assumptions are too complex for me to deal with in this 
context, but there is evidence that that approach does represent the correct analysis of 
examples such as (25ab). 

Recall our earlier discussion of the Attic Greek mén gar clitic sequence. gar is of 
course essentially isofunctional with Rigvedic hi. In addition, Thomson’s (1939) well- 
known paper on the “postponement of interrogatives" in Attic drama supports the idea 
that one could still front into a high Top position in this language. This has a specific 
entailment, since the WH-elements Thomson talks about are in CP, and the topics he 
deals with are higher, and since gar is in C, there should be Attic drama cases exactly 
like the “postponed” subordinating ca, céd, and hi examples we walked through earlier. 
And there are.’ 


(30) a. pros taüta mé psaüséi tis Argeion 
in light of these-API Nec should touch any-NSg,; Greek-GPl 
emoü- 
me-GSg 


7 Note that in (30c) the articular infinitive construction has a proclitic article, and the first prosodic word 
after which gár ‘flips’ is thus t6i=ploutein. 
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[sp'agei ^ parékso gàr dérén eukardíos. 
knife-DSg I submit to gár neck-ASg bravely 
‘In light of these things, no one of the Greeks need touch me, because I will 
bravely submit my neck to the knife: Eur. IphA 1559-1560 
b. hón g  otite métron out’ arit"mds esti moi: 
which-GPl PTCL NEG measure-NSg NEG number is me-DSge 
[kakói kakón gàr eis hámillan érk'etai. 
trouble-DSg trouble-NSg gár to competition-ASg comes 


*Of which (woes) there is neither measure nor number for me, because woe 


comes into competition with woe: Eur. Tro 620-621 
c. kai né di’ eí tí g  ésti lampron kai 
and by Zeus-ASg if something-NSg PTCL is splendid-NSg and 
kalón 
beautiful-NSg 
è k'aríen ant'rópoisi, dia sé gígnetai- 


or elegant-NSg men-DPl through you-ASg it comes about 

[hápanta] tói-ploutein gár est" hupékoa. 

everything-NPl Art-DSg-being rich-INF gar is — subservient-NP] 
'and, by Zeus, if something is splendid and beautiful, or elegant for men, it 
comes about through you (=Wealth), because everything is subservient to 
being rich: Ar. Plutus 144-146 


In cases involving both mén, which marks focus, and gar, meaning ‘because’, the in- 
terpretation of scope within the clauses indicates that we are dealing with a structure 
such as “BECAUSE (gar) one the one hand (mén) ... on the other hand (dé) ..? When there is 
nothing for the gar to lean leftwards on, we get the surface order mén gar. This indicates 
sequential “prosodic inversion” of the form: 


(31) [cp gár [mén [X mén gar Y Z]]] 


But this is an “inside-out” (mén first, then gár) resolution of the hosting needs of these 
elements. If the Vedic mechanisms are the same — and all indications are that they are 
- then this is clear evidence against the analysis in (26), favoring the (28) analysis. The 
implications of this prosodic “tucking in” have not been explored in any significant detail. 


8 Conclusions 


If we tie the domain of a clitic like hi or gar to its semantic scope - which we can easily 
do by positioning it via the syntax, whose job, after all, is to create precisely these kinds 


8 The particle mén normally has a contrasting element, marked by the particle dé. I translate the contrastive 
relationship between these two elements as ‘on the one hand X, on the other hand Y' below. 
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of scope relations — we need not worry about finding the structure for it to be in “second 
position” in. If the clitic cannot be hosted on its left in situ, that structure will be the 
one which provides the nearest prosodic host to the right of the syntactic position of the 
clitic, regardless of what entity that is. 

As with other syntactic entities that take arguments, we sometimes do have to specify 
the nature of those arguments. But it doesn’t follow from that that we need to do it 
stipulatively — it isn’t chance that there is no word-level hi or gar ‘because’ clitic. A word 
doesn't express the kind of things ‘because’ needs to take as an argument to generate a 
coherent semantics.’ But that same word would work fine as an argument of ‘and’, 

Given a sufficiently rich understanding of the semantics of a particular enclitic or post- 
positive, we should be able to deduce the nature of the kinds of syntactic entities it can 
take as an argument. No stipulation should be needed. Of course we are far from having 
this kind of understanding of the meaning of many Vedic and Attic Greek enclitics. 

To the extent we can determine with some degree of confidence the syntactic position 
of the enclitic elements we are interested in, we are in an excellent position to examine 
their surface position (which may be the same as their syntactic position, but may be 
perturbed by the “prosodic inversion” process) with a view to determining the detailed 
mechanics of the interactions involved when multiple 2P elements are present in a string. 
The more explicit a conception we have of the relevant algorithms, the easier this task 
will be. One of the strengths, in my view, of the model assumed here is that its parts are 
all clearly enough defined that it should be easy to discover those instances, if any, in 
which stipulation may be, unfortunately, required. 

By contrast, approaches which leave vague the processes that give rise to clitic se- 
quencing are revealing in that shortcoming their general inadequacy. Getting prosodic 
positioning to interact in the required way with, on the one hand, syntactic positioning 
(which all grammatical theories require) and, on the other, with semantic interpretation 
(ditto), is a very non-trivial problem: it goes to the core architecture of the grammar. 
Working out the details of one’s assumptions in this domain cannot be left as an exer- 
cise to future work - one needs to be formulate a clear notion about such things going 
in. When there are multiple clitics we see overtly the failure of inexplicit models (such 
as Hale 1987, and a lot of subsequent work), but those same problems are present, if 
obscured, in the case of simple clitics as well. 


? Yes, I know about the prepositional ‘because’ phenomenon. If you think about what such strings mean, 
and assume that their meaning is representationally present (but not all pronounced), as in 


Q: Who slew Vrtra? 
A: Indra. 


in which ‘Indra’ means ‘Indra slew Vrtra’ (because it can be a lie, and only propositions can be false, not 
nouns), then you'll see why I don't think this is a problem. Anyway, there's no evidence that the speakers 
of Rigvedic Sanskrit could say: ‘Indra slew Vrtra. Because, the waters: 
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Nominal verbs and transitive nouns: 
Vindicating lexicalism 


Paul Kiparsky 


Stanford University 


Event nominalizations and agent nominalizations provide evidence that all affixation is mor- 
phological, and that phrasal categories are projected from words in the syntax. Departing 
from both transformational and earlier lexicalist approaches to nominalizations, I first ar- 
gue on the basis of English and Finnish evidence that gerunds are not DPs built on heads 
that embed an extended verbal projection (Baker & Vinokurova 2009; Kornfilt & Whitman 
2011), but IPs that need Case. They are categorially verbal at all levels of the syntax, includ- 
ing having structural subjects rather than possessor specifiers. Their nominal behavior is 
entirely due to the unvalued Case feature borne by their Infl head, which they share with all 
participial verb forms. I then argue that agent nominalizations are categorially nominal at 
all levels of the syntax, and that the verb-like case assignment of transitive agent nominal- 
izations is due to the verbal Aspect feature borne by their nominalizing head. Vedic Sanskrit, 
Northern Paiute, and Sakha evidence is shown to favor this analysis over B&V’s analysis 
of intransitive agent nominalizations as nominal equivalents of Voice heads and transitive 
agent nominalizations as Aspect heads. The two “mixed” categories — gerunds and transitive 
nominalizations - thus prove to be formally duals: respectively verbs with Case and nouns 
with Aspect. 


1 Nominalizations 


The earliest generative work derived all nominalizations syntactically (Chomsky 1955; 
Lees 1960). Chomsky (1970) then argued that only -ing gerunds are derived syntactically, 
while all other types of event nominals, such as refutation, acceptance, refusal, are de- 
rived morphologically in the lexicon from bases that are unspecified between nouns and 
verbs. The suffix -ing was shown to serve both as the gerund formative and as one of 
the formatives that derive lexical event nominals. Chomsky's main argument was based 
on the fact that gerund phrases have the structure of verb phrases whereas other event 
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nominals have the structure of noun phrases.! The differences are completely systematic. 
Unlike derived event nominals, gerunds are modifiable by adverbs, assign structural case 
to their complements (Table 1a,b), disallow articles and other determiners (Table 1c) and 
plurals (Table 1d), allow aspect (Table 1e) and negation (Table 1f), and they have a gram- 
matical subject which is assigned a Th-role as in finite clauses (Table 1g), and which may 
be an expletive (Table 1h). In (Table 1g), reading refers to an event and her to its agent, 
the reader. In the derived nominal, her could also be a sponsor, organizer, or some other 
participant of a reading event not necessarily identical with the reader (Kratzer 1996; 
2004), and reading could also mean ‘manner of reading’, ‘interpretation’. Without an 
of complement, derived nominals are also interpretable in a passive sense: in Mary's 
confirmation, Mary could be the confirmer or the confirmee. 


Table 1: Gerunds vs. Nominals. 


Gerunds Nominals 
(-ing") (-ingN, -ion, -al, -ance...) 
a. Adjectives “her quick signing the her quick signing of the 
document document 
b. Adverbs her immediately reciting it “her immediately recital of it 
c. Determiners *the/a/this/each performing it ^ the/a/this/each performance 
of it 
d. Plural *her readings it / her readings of it 
“her reading its 
e. Aspect by her having sung it “by her having sung of it 
f. Negation by her not approving it “by her not approval of it 
Subject we remembered her reading it we remembered her reading 
of it 
h. Expletives it(s) seeming to me that I *it(s) appearance to me that I 


exist? exist 


The lexicalist line of analysis continues to be developed in different ways (Malouf 
2000; Blevins 2003; Kim 2016). But many recent treatments have reverted to a uniformly 
syntactic derivation of nominalizations, in which nominalizing heads project a nominal 
structure and have a verbal complement whose type determines the nominalization's 
properties. The differences between the two types in Table 1 is captured by introducing 


! Chomsky also contrasted the uniformity, regularity and full productivity of gerunds with the morphologi- 
cal and semantic diversity, idiosyncrasies, and limited productivity of derived event nominals. As Anderson 
2016 notes, these points played a subsidiary role in Chomsky's argument. Indeed, they are not compelling 
criteria by themselves, for there is no shortage of productivity and regularity in the lexicon, and syntax 
has its share of idiosyncrasy. 

2 T can't help but feel a little despondant due to it seeming to me that the TIE/fo and the T-70 make the original 
TIE and T-65 somewhat redundant (Internet), evidence that "explains away" its seeming to me that p is the 
case (James Pryor, The Skeptic and the Dogmatist, Noûs 34: 534, 2000). The variation between Poss-ing and 
Acc-ing gerunds seen here is briefly addressed in 2.3 below. 
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them at different levels in the functional structure. The gerund -ing" is structurally high, 
and derived nominals in -ingN, -ion, -al, -ance are structurally low. 

Kornfilt & Whitman (2011) dub this the FUNCTIONAL NOMINALIZATION THEsIs (FNT), 
and propose a typology of four levels of nominalization, CP, TP, vP and VP. In this 
typology, English gerunds are TP nominalizations, while derived nominals are VP nomi- 
nalizations.? This paper vindicates a uniform treatment of nominalizations in a different 
way: all true nominalizations are derived lexically; gerunds are not nominalizations at 
all - they are neither DPs nor NPs but IPs that need Case. 

As my point of departure I take Baker & Vinokurova's (2009) theory, which extends 
the FNT from event nominalizations to agent nominalizations. For gerunds, B&V posit 
the structure (1a), based on the version of the DP analysis originated (along with the DP 
itself) by Abney 1987. The DP's complement here is an NP headed by the gerund nominal- 
izer -ing", below which the structure is entirely verbal: an AspP which hosts aspectual 
material and certain adverbs, and which has a vP complement whose v (v=Voice) head 
assigns structural case and introduces an external agent argument. This external agent 
argument shows up as a genitive in D head. B&V do not say exactly how it gets there 
in (1a); perhaps it is base-generated in D and bears a control relation to the PRO in the 
Spec-vP position where the agent role is assigned. 

For derived nominals B&V propose the structure (1b), where the head (-ing’, -ion, etc.) 
takes a bare VP complement. Because it has no Asp or v projection, it contains neither 
adverbs, agents, nor structural case.* 

The structures in (1) take care of the contrasting properties Table 1b, e, f, and g, but 
leave the remaining four properties (Table 1a, c, d, and h) to fend for themselves. On the 
one hand, the DP in (1a) provides too little structure: expletive it-subjects are believed to 
occupy Spec-IP or Spec-TP, but (1)a provides no Spec-IP or Spec-TP for them? On the 
other hand, the DP, needed in the analysis as a site for the gerund's subject, generates 
unwanted structure. Since DPs can have plural heads, adjective modifiers, determiners, 
and quantifiers, the DP analysis wrongly predicts that Table 1a, c, and d should be gram- 
matical. To maintain it one must somehow prevent functional projections like AP, OP, 
and NumP from appearing in DPs that have NP complements that have AspP comple- 
ments, while allowing them in other kinds of DPs, and one must prevent the head of a 


3 A syntactic derivation of gerunds from TP/IP is also developed by Pires 2006. The aspectual content of the 
gerund is treated in Pustejovsky 1995; Alexiadou 2001, and Alexiadou, Soare & Iordáchioaia 2010. Alexiadou 
and her co-workers conclude that gerunds are imperfective Aspect heads that dominate VoiceP and vP, 
while nominalizers are n heads that also dominate VoiceP and vP, but under NumberP and ClassifierP, 
housing adjective modifiers, determiners, and plural. 

4 All analyses have to contend with the fact that certain adverbs can occur as postmodifiers with derived 
nominals, and even with some underived ones (Payne, Pullum & Huddleston 2010); they cite examples such 
as the opinion generally of the doctors, a timber shortage nationally, the people locally, and the intervention 
again of Moscow. We shall see similar Finnish data in $3 below. 

5 Expletive there, which likewise appears in gerunds, may sit in a lower subject position, since it is sensitive 
to the argument structure of the predicate — the absence of Cause according to Deal 2009, who puts it 
in the specifier of v. Like expletive it, there does not appear in derived nominals ("there's appearance to 
be a problem). On Deal’s analysis, the distribution of expletive there is consistent with my IP analysis of 
gerunds, but adds no further support to it. 
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DP whose complement is an NP whose complement is an AspP from being an article or 


a demonstrative pronoun. 


(1) a. his reading the book (high) b. the reading of the book (low) 


DP DP 
Lee ee 
DP D’ D NP 
D NP the N VP 
a NE n 
s N AspP -ing V DP 
-ing Asp vP read ofthe book 
Se. 
(PRO) v 
P di 
v VP 
ANR 
V DP 


read the book 


case 


Contrary to what the FNT seems to promise, then, the morphosyntactic properties ofa 
nominalization cannot be fixed just by locating its nominal head in a universal hierarchy 
of verbal functional categories, or even in a language-specific one. In that approach to 
mixed categories, it seems that the functional content that a given nominalizing head 
may combine with must be specified on an item-specific basis. But not just any arbitrary 
mixed category is possible. Consider the awesome unused power unleashed by the FNT. 
If functional N heads can convert AspPs into NPs in the syntax, as in (1a), why aren't 
there such things as Q heads with vP complements (*[some [he read it],p Joe), let alone 
multiple verbalizing and nominalizing syntactic heads interspersed to generate phrases 
in which layers of verbal and nominal structure alternate in various combinations? 


é Some of the overgeneration could be curbed by by eliminating the DP layer, or by eliminating the NP layer 
and having D select for AspP directly. But these projections cannot be struck from (1) because their heads 
are essential to the analysis. The D head serves as the site of the structural subject, and the N head houses 
the nominalizer -ing. Neither of these elements can be accommodated in the Asp head, for that is required 
for the aspectual auxiliary have. 
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The empirical problem of overgeneration is a direct result of the theoretical approach 
behind the FNT-style analysis. The derivation of gerunds in (1a) involves syntactic affix- 
ation of -ing to the phrasal projection AspP.’ A lexicalist perspective rules out affixation 
to phrases. It dictates an entirely different kind of derivation, in which the gerund suffix 
-ing is added to verbs in the morphology to build words (e.g. reading), which are then 
inserted in terminal nodes in the syntax. On this view, a gerund phrase is the syntactic 
projection of a gerund, not of a determiner as in (1a). On that basis we can build a simple 
and restrictive theory of nominalizations that explains all the data in Table 1. 

The key idea is that gerunds are participles, and that participial suffixes, -ing included, 
are Infl heads that differ from finite and infinitive Infl heads in that they bear a Case 
feature. The extended projection of a gerund is then an IP with a Case feature, which 
needs to be checked (or, from a non-lexicalist perpective, valued) in the syntax. The Case 
feature restricts participial phrases to two syntactic functions: arguments — gerunds - in 
positions where their value for Case can be checked by a predicate, and participial modi- 
fiers in positions where their value for Case can be checked by head-modifier agreement. 

Lexicalism excludes not only FNT-style analyses of gerunds, but every kind of syntac- 
tic affixation to phrasal categories. This means that no syntactic process can have the 
effect of changing the category of a word. That holds for all types of nominalization: 
event nominalizations, result nominalizations, and agent nominalizations. All “mixed 
categories” must then arise from morphological specifications of lexical heads, rather 
than from syntactic embedding as in (1). In §3 I support this more general prediction by 
showing that transitive agent nouns do not have an embedded vP projection and that 
their verbal properties come from a Tense/Aspect feature on the agent suffix. 

I assume that a phrasal constituent is a projection of its head, which inherits its cat- 
egory (Noun, Verb, etc.), its inflectional features (such as Aspect and Case), and its the- 
matic roles (Agent, Patient, Instrument, Event, etc.).? Mixed categories are verbs, nouns, 
and adjectives that have an extra phi-feature. Their extended projections behave like 
extended projections of ordinary verbs, nouns, and adjectives, modulo the properties en- 
forced by that feature content. The language-specific syntax of gerunds is determined by 
their Case feature. A gerund that can bear any Case projects a phrase with the distribu- 
tion of a DP. A gerund that has a partially specified Case feature projects a phrase that is 
restricted to positions compatible with the specified values of the feature. For example, 
Finnish gerunds are restricted to internal argument positions (section 2.2). Similarly, the 
verbal properties of transitive agent nouns are due to a Tense/Aspect feature assigned 
to these nouns by the agent affix that forms them. This feature may likewise be lexically 
unvalued and specified by additional aspectual morphology (as in Northern Paiute), or 
inherently specified on the agent noun affix itself (as in Sanskrit and Sakha), see 3.3. Since 
the mixed categories under lexicalist assumptions are projected from a single head, we 
correctly predict the absence of mixed categories in which verbal and nominal structure 


7 A similar earlier proposal is Yoon (1996). 

8 Eg. [-er] = APAxAe[P(e) ^ Agent(e,x)] (the set of human individuals that are the Agent of some event), 
[-ee] = APAxAe[P(e) ^ human(x) ^ Undergoer(e.x)] (the set of human individuals that are the Undergoer 
of some event). 
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is alternately layered in weird combinations, of vPs that function as DPs, and of the other 
abovementioned monstrosities. 

A theoretical gain is that we need not divide nominalizations into a syntactic type and 
a lexical type, as in standard lexicalist analyses. Once gerunds are recognized as IPs, we 
can maintain that all nominalizations are derived morphologically in the lexicon. This 
can be done either in a realizational morphology of the type pioneered by Anderson 
(1992), or in a morpheme-based one such as the minimalist morphology of Wunderlich 
(1996a). It remains to be seen whether the analysis can be recast in a DM-friendly syntax- 
based format. What is clear is that it does not follow from any theory that countenances 
structures like (1). To that extent at least, its empirical success constitutes new empirical 
support for lexicalism. 

I begin in §2 with “high” event nominalizations. I show that the lexicalist approach 
correctly predicts the syntax of English and Finnish gerund phrases, including aspects 
that go unexplained in FNT analyses, and that it curbs the typology in a good way. In 
§3 I apply the same idea to agent nominals, and support the resulting analyses with data 
from Vedic Sanskrit and Finnish that is new to the theoretical literature. 


2 Gerunds 


2.1 English gerunds 


Gerunds and participles are formally identical in English (Pullum 1991; Yoon 1996; Hud- 
dleston & Pullum 2002; Blevins 2003). For example, they are the only verb forms that 
overtly distinguish perfect aspect but not progressive or past tense. Given the modest 
morphology of English this identity might be dismissed as an accident, but the testimony 
of richly inflected languages, such as Finnish (2b), Classical Greek (2c), Sanskrit (2d), and 
Latin (2e) leaves no doubt that participles are systematically used in two functions: ad- 
jectivally as modifiers and nominally as arguments. 


(2 a. English -ing participle 
i Modifier: I saw Bill reading the book. (= I saw Bill.) 
ii. Argument: I hated Bill's reading the book. (= I hated Bill.) 
b. Finnish participles 

i Modifier: 
Muist-i-n hunaja-a ` syó-vá-n ` karhu-n. 
remember-pst-1sc honey-PRTC eat-PTC-GEN bear-Acc 
‘I remembered the/a bear (that was) eating honey: 

ii. Argument: 
Muist-i-n karhu-n syó-và-n hunaja-a. 
remember-PnRTC-1sG bear-GEN eat-PTC-GEN honey-PRTC 


‘I remembered that the/a bear ate honey: 
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. Classical Greek participles 


i. 


ii. 


Modifier: 

tón adikoü-nt-a Phílippo-n apéktein-a 
the-Acc act-unjustly-prrc-acc Philip-Acc kill-Aon-1sc 
‘I killed the unjustly acting Philip? 

Argument: 

adikoü-nt-a Phílippo-n eksélenk-s-a 
act-unjustly-PRTC-AcC Philip-Acc prove-AoR-1sc 


'I proved that Philip acted unjustly? 


. Sanskrit participles 


i. 


ii. 


Modifier: 

rajan-am a-ga-ta-m $r-no-mi 
king-Acc to-go-PRTC-ACC hear-PRES-1sG 
‘I hear the king (who has) arrived? 
Argument: 

rajan-am a-ga-ta-m $r-no-mi 
king-Acc to-go-PRTC-ACC hear-PRES-1sG 
‘I hear that the king has arrived: 


. Latin participles 


i. 


ii. 


Modifier: 
Hannibal vic-tu-s ad Antiochu-m ` confug-i-t 
Hannibal.NoM defeat-PRTC.MASC-NOM to Antiochus-Acc flee-PERF-3sG 


"Defeated, Hannibal took refuge with Antiochus: 


Argument: 

Hannibal vic-tu-s Romano-s metu 
Hannibal.NoM defeat-PRTC.MAsC-NOM Roman.ACc.PL fear.ABL 
libera-vi-t 


free-PERF-3SG 


‘Hannibal’s being defeated freed the Romans from fear. 


Traditional grammars of these languages treat participles as verb forms which are in- 
flected for Case, for good reasons. Participles distinguish the verbal categories of voice 
and tense/aspect, and they are formed off the same tense/aspect stems as the finite verbs. 
They supply the periphrastic forms that complete gaps in inflectional paradigms. They 
assign the same cases to their objects as the corresponding finite verbs and infinitives 
do. They are modified by adverbs, not by adjectives. They select for the same prefixes as 
the corresponding finite verbs and infinitives, with the same (often idiosyncratic) mean- 
ings. Those languages that disallow noun+verb compounds (such as classical Greek and 
Sanskrit) also disallow noun+participle compounds. As I show below, participles have 
structural subjects. 
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So there must be some property that distinguishes participles from finite verbs and 
infinitives, and which supports the double function of participles as nominal arguments 
and adjectival modifiers. The obvious candidate is Case. Suppose then that participial 
formatives are Infl heads that need Case. On lexicalist assumptions, they are affixed in 
the morphology to a verb to make a participle, which is then inflected for case if the 
language has case morphology, and enters the syntax with a specified Case feature that 
- like any Case feature - must be checked in the syntax. In a language that lacks case 
morphology, such as English, the participle remains unvalued for Case, and projects an 
IP with a Case feature that must be valued in the syntax. Both “checking” and “valuing” 
can be formalized as identical operations of feature unification, or as optimal matching 
in OT Correspondence Theory. 

As an illustration consider first the derivation of gerunds in English. Prescinding from 
vP, AspP, VoiceP, and other possible functional projections, their syntactic structure is 
as in (3). 


(3 ) IP [uCase] 


quU PLA 


DPicen] I’ 


Se. |, EUN 


The man's  Infljucase] VP 


reading it 


Inflj;cas;] combines with V in the same way as Tensed Infl does. How this happens de- 
pends on the model of grammar. If we assume both lexicalist syntax and lexicalist morph- 
ology, the case-needing Infl -ing is suffixed to V in the morphology to form a participle, 
and the participle then projects a case-needing IP in the syntax, where the Case feature is 
valued. In argumental participles (gerunds), it is valued by the governing Case-assigner, 
and in participial modifiers it is valued by agreement with the nominal they modify. 

If we assume minimalist syntax, we can comply with lexical morphology by using 
spanning (Svenonius 2016), which allows the lexically generated participle to be inserted 
under the two corresponding syntactic terminal nodes. In DM, -ing would be a syntactic 
terminal that is postsyntactically Lowered onto V. Thus, the idea that gerunds are case- 
needing IPs can probably be implemented in any grammatical architecture. However, 
in non-lexicalist frameworks this analysis is merely motivated on empirical grounds. 
In lexicalist frameworks that prohibit affixation to phrases it is required on principled 
grounds as well. 

In languages where participles are morphologically inflected for Case, such as (2b-e), 
participles project an IP that bears a specified Case value that must be checked in the 
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syntax. The lexicalist approach now makes an interesting prediction: in such languages, 
gerunds may be morphologically restricted to particular Case features, which restrict 
their syntactic distribution to contexts compatible with those features. This prediction 
is confirmed in Finnish. Finnish gerunds bear an oblique Case - glossed as Genitive in 
(2b.ii) — which confines them to internal argument positions (section 2.2).? 


(4) IP [Gen] 


m MUN 


DPicen] I 


P Do NP c 


karhun(gey]  InfliGen] VP 


V DP 


syövän hunajaaypart] 


Returning to English, the analysis of participial clauses as IPs that have the single 
special property of needing structural Case explains at a stroke which clausal and nom- 
inal properties they have and which ones they lack. To start with the latter, it explains 
why gerund phrases, unlike DPs, have no articles, quantifiers, or numerals, why they 
cannot be modified by adjectives and relative clauses, why their head cannot be genitive 
or plural. 


(5) a. * The/a/each compiling the corpus took over a year. 
b. *Both/every compiling corporas took over a year. 

* His two compilings corpora each took over a year. 

* His careful compiling the corpus was a turning point. 


* His editing texts that is funded will take a year. 


mo mo 


* His compiling corpora’s results were dramatic. 


The missing categories are just the ones that would originate in a DP. 

As for the nominal properties that gerunds do have, they are accurately covered by the 
generalization that gerunds appear in Case positions. They function as subjects, objects, 
and predicates, as objects of prepositions (6e,f,g), and as objects of a small set of transitive 
adjectives (6f,g), all diagnosed as Case positions by the fact that full-fledged DPs occur 
in them. 


(6) a. [Bill'sleaving Paris ] was unexpected. 
b. Iregret [ Bill's leaving Paris ]. 
c. The problem is [ Bill's leaving Paris ]. 


? There is no agreement relation between the genitive subject and the gerund in (4). 
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d. Because of Bill's leaving Paris we'll be hiring new personnel. 
e. We are worried about Bill's leaving Paris. 
f. This event is worth my visiting Paris. 


g. It's no good my playing this sort of game.!? 


This does not mean that all transitive verbs take gerund complements. Particular verbs 
can select for whether they take gerunds, that-clauses, or infinitive complements, just 
as they can select for whether they take DPs: 


(7) a. "*Isaid Bill's leaving Paris. 


b. Isaidit/something/several things. 


What the analysis correctly predicts is that gerunds, unlike that-clauses and infinitives, 
appear only in Case positions: 
(8) a. i *IhopeBills leaving Paris. 
ii. * I hope it. 
iii. I hope that Bill is leaving Paris. 
iv. Ihopeto leave Paris. 
b. i *Itis rumored Bill's leaving Paris. 
ii. * The proposal is rumored. 
iii It is rumored that Bill is leaving Paris. 
iv. It is rumored to be happening. 
c. i. *It seemed / was expected Bill's leaving Paris. 
ii. * It seemed / was expected this event. 
iii. It seemed / was expected that Bill would leave Paris. 


iv. Itseemed / was expected to happen. 


A further consequence is that the subjects of gerunds are IP specifiers. If overt, they 
are Genitive or Accusative," just as the subject of a finite IP is Nominative, and the 
overt subject of an infinitive requires a Case-assigning for. Crucially, they are true struc- 
tural subjects analogous to subjects of finite clauses, not necessarily "agents" as in B&V's 
Table 1a, nor “possessors” with their varied functions as in derived nominals. This predic- 
tion is confirmed by three generalizations. Unlike genitive specifiers of nouns (including 
derived nominals), but like structural subjects of finite clauses, the specifiers of gerunds 
can be expletives: 


(9) a. It(s) seeming to you that you dreamt is not evidence of it(s) being the case 
that you dreamt. 


b. *It(s) appearance to you that you dreamt is not evidence of it(s) truth that you 
dreamt. 


10 Cf. It’s no good this sort of game. (Dickens, Our Mutual Friend) 
H On Acc+ing gerunds see the brief and inconclusive remarks in 82.3 below. 
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Like structural subjects of finite clauses, they are subject to control (Huddleston & Pul- 
lum 2002: 1190): 


(10) a. Mary remembered locking the door. [the rememberer is the locker] 


b. Mary remembered the/a locking of the door. [the rememberer might not be 
the locker] 


like structural subjects of finite clauses, and unlike genitive agents of nominals, they 
cannot be paraphrased with of or by: 


(11) a. the Persians’ quick run = the quick run of/by the Persians 
b. the Persians’ quick running = the quick running of/by the Persians 
c. the Persians’ quickly running # “quickly running of/by the Persians 


d. the Persians’ quickly attacking the Greeks + "quickly attacking the Greeks 
of/by the Persians 


e. the Persians quickly attacked the Greeks 7 "of/by the Persians quickly 
attacked the Greeks 


To summarize: the analysis of gerunds as IPs with Case explains the cross-linguisti- 
cally common convergence of nominal and adjectival functions in a single morphological 
class of verbal forms. By not positing any DP or NP structure over the IP it avoids the 
overgeneration problem that FNT-type analyses face. It excludes the possibility of mul- 
tiple alternating verbalizing and nominalizing syntactic heads to which the FNT opens 
the theoretical door, and gets rid of the constraint that heads of DPs whose complements 
are NPs whose complements are verbal projections may not be articles or demonstrative 
pronouns. It provides the basis for a uniform structure for all DPs, and for a uniform lex- 
ical derivation of all nominalizations. It correctly predicts that gerunds and participles 
have subjects — specifiers of Infl that are structural counterparts to the subjects of finite 
clauses. What is important is that the analysis is not motivated merely by these empir- 
ical arguments; it is a consequence of lexicalism, and, if correct, supports the lexicalist 
organization of grammar. 

The question arises whether there might be a CP layer above the IP, headed by a 
null complementizer. This additional structure is not justifiable for English, because the 
distribution of gerunds differs from that of any type of CP. First, gerunds need Case, 
whereas CPs do not (Vergnaud 1977). Secondly, gerunds are permitted in clause-medial 
position, while that-clauses and other CP clauses must extrapose.”” 

That gerund phrases are full IPs with a structural subject, that they bear Case, and 
that, unlike derived nominals, they have no DP or NP projection, and in particular no 
possessor-type Specifier, makes many additional predictions that are testable in morpho- 
logically richer languages. They turns out to be abundantly supported, as demonstrated 
for Finnish in the next section. 


12 However, the case marking of participles in inflected languages could be considered as a kind of comple- 
mentizer, as conjectured for the inherent case affix -n on Finnish gerunds in 82.2. 
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2.2 Finnish gerunds are IPs 


Finnish participial propositional complement clauses are the closest functional coun- 
terparts of English gerunds, and I will call them gerunds here. They are not DPs with 
possessors but IPs with true structural subjects. Their Case is inherently marked by the 
oblique suffix -n, arguably functioning as a complementizer, which restricts them to in- 
ternal argument positions. This illustrates how the typology of gerunds emerges from 
variation in what cases they can bear.’ 

Unlike English gerunds, Finnish gerunds are never external arguments. Thus they can 
be objects of transitive verbs such as say, think, want, prove, remember and hear, and 
subjects of presentational intransitives like appear and become evident, but they cannot 
be subjects of such predicates as be obvious, prove, and mean.'* 


(12) a. Selvis-i Matin | ampu-nee-n karhu-n. 
become-clear-PsT.3sc Matti-GEN shoot-PERF.PRTC-GEN bear-ACCgrn 
‘Tt became clear that Matti had shot the/a bear. 


b. *Matin ` ampu-nee-n karhu-n ` suututt-i 
Matti-GEN shoot-PERF.PRTC-GEN bear-ACCggy anger-PsT.3sG 
Liisa-a. 

Liisa-PART 
‘That Matti had shot the/a bear angered Liisa’ 


This distribution suggests that the ending -n that participles bear in their gerundial func- 
tion, glossed “GEN” in (12), marks an object Case that is compatible with internal argu- 
ments but not with external arguments. Historically, it is probably the old dative ending, 
which has fallen together phonologically with the genitive, but persists as a morphosyn- 
tactically distinct type of genitive which (unlike the structural genitive) cannot function 
as a subject (Kiparsky In press). 

As shown in (13), Finnish gerunds behave more like bare finite CP clauses with ettd- 
(that-) than like DPs, whether nominal DPs (13c) or pronoun-headed finite clauses with 
se että- (it that-) (13d). 


(13) a. Huomas-i-n / ymmárrá-n /luule-n /otaksu-n  tilante-en 
notice-lsc / understand-1sc / think-1sc / assume-1sc situation-GEN 
ole-va-n hankala-n. 
be-PRTC-GEN difficult-GEN 
‘I noticed / understand / think / assume that the situation is difficult. [lit. ‘the 
situation's being difficult] 


13 The data and analysis of Finnish gerunds presented in this section is condensed from my treatment of 
Finnish nonfinite complementation in Kiparsky (In press), to which I refer the reader for the details. 

14 Tn the glosses, ACCgEn and ACCyom both refer to morphosyntactic Accusative structural case. The subscripts 
show three different morphological case realizations of this morphosyntactic Case. They will become im- 
portant shortly, but for now the reader may ignore them. 
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b. Sen huomat-tiin — /ymmárre-táàn /luul-laan / otaksu-taan 
It-GEN notice-PAss-PsT / understand-Pass / think-PAss / assume-PAss 
ole-va-n hankala-n. 
be-PRTC-GEN difficult-GEN 


‘It is noticed / understood / thought / assumed to be difficult: 


c. Huomas-in /ymmärrä-n /*luule-n /*otaksu-n hane-t / 
notice-PsT-1sc / understand-1sc / think-1sc / assume-1sc him-ACCacc / 
se-n seika-n. 


that-AcCggy thing-ACCern 


, 


‘I noticed / understand / think / assume him / this point (fact) 


d. Huomas-i-n /ymmárrà-n /*luule-n /*otaksu-n Sen. ettà 
notice-PsT-1sc / understand-1sc / think-1sc / assume-1sc it-ACCggy that 
tilanne on hankala. 


situation.NoM is difficult 


‘I noticed / understand / think / assume that the situation is difficult? 


The distinction between verbs that allow DP objects (huomat- ‘notice’ and ymmártá- in 
(13)) and verbs that do not allow DP objects (luule- ‘think’ and otaksu- ‘assume’ in (13)) 
is correlated with factivity, but the correlation is not exact and my argument does not 
depend on it. 

Since gerunds are not DPs but case-marked IPs, their genitive subjects behave like 
structural subjects and not like genitive specifiers of DPs. This is shown by five argu- 
ments. 

The first argument that the genitive specifier of gerunds is a grammatical subject is 
that it gets assigned exactly the same Th-roles as the subjects of the corresponding finite 
clause, not the diverse range of interpretations that “possessors” of derived nominals 
receive (see above under Table 1). So Matin in (12a) picks out the agent of the shooting 
event, whereas the specifier Matin of the derived nominal (14) could be, among other 
things, the organizer or theme of the rescue. 


(14) Muista-n Mati-n X pelastukse-n. 
remember.PRES-1sc Matti-GEN rescue-NOM-ACC 
‘I remember Matti's rescue: 


The second argument comes from extraction. The subjects of gerunds can be extracted 
as readily as objects: 


(15 a. Kene-n váit-i-t ampu-nee-n  hàn-tàá? 
who-GEN claim-PsT-2sc shoot-PFP-GEN he-PART 
‘who did you claim shot at him?’ 
b. Ke-ta vait-i-t hàne-n ampu-nee-n? 
who-PART claim-PsT-2sc he-GEN shoot-PFP-GEN 


‘who did you claim he shot at?’ 


323 


Paul Kiparsky 


But possessors cannot be extracted (16a), and neither can genitive specifiers of tense- 
less nonfinite complements such as the third infinitive (16b) and the second infinitive 
(16c) (the Left Branch Condition, Ross 1967: 127). 


(16) a. *Kene-nj_ váit-i-t ammu-tu-n ti karhu-n / että 
who-GEN claim-psT-2sc shoot-PERF.PRTC-GEN bear-GEN / that 
ammu-ttin ti karhu? 


shoot-PAss.PST bear.ACCyom 
‘Whose bear did you claim (that) was shot?’ 


b. *Kene-n; väit-i-t ej ampu-ma-n karhu-n paina-nee-n 
who-GEN claim-Psr-2sG ` shoot-31Nr-GEN bear-GEN weigh-PERF.PRTC-GEN 
500 kilo-a? 
500 kg-PART 


"Ihe bear shot by whom did you claim weighed 500 kg?’ 

c. *Kene-n; itk-i-t e; ampu-e-ssa karhu-n? 
who-GEN claim-Psr-2sG — shoot-2INF-INESs bear-ACCgpn 
"Who did you weep while he shot the/a bear?' 


A third diagnostic which shows that gerunds have structural subjects and not posses- 
sors is that they do not undergo possessor agreement. Nouns and infinitives agree with 
their genitive specifiers, as exemplified for nouns in (17a), for the second infinitive in 
(17b), and for the third infinitive in (17c). 


(17) a. (Minu-nj) karhu-ni; paino-i 500 kilo-a 

(My-GEN) bear.NoM-1sc weigh-pst-3sc 500.Acc kg-PART 
‘My bear weighed 500 kilograms’ 

b. Matti itk-i (minu-n;) ampu-e-ssa-ni; karhu-n 
Matti.NOM weep-PsT.3sc (my-GEN) shoot-21Nr-1sc bear-ACCgrn 
‘Matti wept as I shot the/a bear. 

c. (minu-nj) ampu-ma-ni; karhu 
(my-GEN) shoot-31Nr-1sc bear.NoM 
‘the/a bear I shot? 


But gerunds do not possessor-agree with their subjects, as we can see in (18a,b) and (with 
a raised subject) in (18c). 


(18) a. Matti ties-i minu-n; ampu-nee-n 
Matti.NOM know-PsT.3sc me-GEN shoot-PRF.PRT-GEN 
(*ampu-nee-ni;) karhu-n 


(shoot-PRF.PRT(-GEN)-1sG) bear-ACCgzn 
‘Matti knows that I've shot the/a bear? 
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b. Selvis-i hane-nj ampu-nee-n 
become clear-Psr-3sc he-GEN (shoot-PRF.PRT(-GEN)) 
(*ampu-nee-nsa;) karhu-n 
(shoot-PERF.PRTC(-GEN)-3P) bear-ACCgpn 
‘it became clear that he had shot the/a bear’ 
c. Nayta-t; ampu-nee-n (“ampu-nee-sij) karhu-n 
seem-2sc shoot-PRF.PRT-GEN (shoot-PERF.PRTC-(GEN)-2SG) bear-ACCgzn 


‘you seem to have shot the/a bear: 


Of course the subjects of gerunds cannot subject-predicate agree with the gerunds like 
nominative subjects of finite clauses agree with the finite verb, for genitive subjects never 
subject-predicate agree in Finnish. 

The fourth argument that gerunds have structural subjects comes from the distribution 
of accusative Case morphology. Descriptively, Finnish morphosyntactic Case is realized 
as morphological case as follows.” 


(19) a. The subject of a participial clause is always Genitive. 


b. The object of a participial clause can be morphosyntactic Accusative or 
Partitive. Partitive is assigned to objects under the same conditions as in 
finite clauses: 


i. Objects under overt or implicit negation are Partitive. 
ii. Objects of certain predicates (such as love and touch) are Partitive.!ó 


iii. Otherwise objects are Accusative. 


Morphosyntactic Partitive is always realized as morphological partitive. And now comes 
the essential and trickiest part. Morphosyntactic Accusative is realized by three morpho- 
logical cases: 


(20) a. as morphological accusative on personal pronouns, 


otherwise as morphological genitive if the object is plural, or if the clause has 
a subject with structural case (this last condition is called JAHNsson’s RULE), 


c. otherwise as morphological nominative. 


Clause types that lack subjects with structural case for purposes of Jahnsson's Rule in- 
clude imperatives, bare infinitives (“to see Naples and to die"), passives (which in Finnish 
do not involve “promotion” of the object), and clauses with “quirky case” subjects. 
Since the argument to be presented below uses Jahnsson’s Rule as a diagnostic for the 
presence or absence of a structural subject, I will gloss the examples in such a way that 
the reader can see whether Jahnsson’s Rule has taken effect in them. This means glossing 
not only morphosyntactic Accusative Case, but whether morphosyntactic Accusative 


1 For details see Kiparsky 2001; a sophisticated OT treatment of the variation is developed by Anttila & Kim 
(2016). 

16 The class of partitive-assigning predicates is often called “telic” (e.g. Kratzer 2002). This is not quite correct; 
for an attempt at a more accurate formulation see Kiparsky 2005a. 
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Case is realized as morphological accusative case or nominative case. So I will mark 
morphosyntactic Case by the main gloss and morphological case with a subscript on it. 
For example, in (21) both objects bear morphosyntactic Accusative Case, realized in (21a) 
as morphological genitive case and in (21b) as morphological nominative case. 


(21) a. Matti ampu-i karhu-n 
Matti.Nom shoot-PsT(3sc) bear-ACCcrn 
‘Matti shot the/a bear. 


b. ammu karhu! 
shoot-IMPER bear-ACCyom 


‘shoot the bear!’ 


Through the rest of the text in this section I use capitalization to distinguish morphosyn- 
tactic Case (such as Accusative) from morphological case (nominative, accusative, etc.). 
At last we are ready for the argument. Nonfinite complement clauses are translucent 
to the triggering of Accusative and Partitive Case and to the realization of Accusative 
case as genitive or nominative, in the sense that (19) and (20) can be conditioned either 
within the gerund clause or in the larger domain of the higher clause with its gerund 
complement. So in (22a) the object of the lower clause, which contains no negation, 
can have either Accusative Case (realized as morphological genitive case by (20a)), or 
Partitive Case from the negated main clause by (19b.ii). In (22b) the morphosyntactic 
Accusative Case on the object of the gerund is realized either as morphological genitive 
case because the main clause has a subject, or as morphological nominative case, be- 
cause the participle, being passive, is subjectless (Jahnsson’s Rule, (20b))." In (22c) the 
morphosyntactic Accusative Case on the object can only be realized as morphological 
nominative case because both the matrix verb and the participle are subjectless. 


(22) a. En tien-nyt heidá-n ampu-nee-n / 
Not-1sc know-PERF.PTC they-GEN shoot-PERF.PRTC-GEN / 


ampu-va-n karhu-n ` /karhu-a 
shoot-PRS.PRTC-GEN bear-ACCgpn / bear-PART 


‘I didn't know that they had shot / were (would be) shooting the/a bear: 


b. Ties-i-n metsá-ssá ammu-tu-n karhu-n / karhu 
know-PsT-1sc forest-ILLAT shoot-PASS.PRTC-ACCogn bear-ACCggn / bear.ACCyom 


‘I knew a bear to have been shot in the forest? 


c. Eilen ilmen-i ammu-tu-n "karhu-n  /karhu 
Yesterday turn.out-PsT.3sc shoot-PASS.PRTC-GEN bear-ACCery / bear.ACCyom 


‘It turned out yesterday that a bear was shot: 


17 The variation between case governed locally within the subordinate clause and in the larger domain that in- 
cludes the main clause is sensitive to as yet poorly understood semantic, stylistic and discourse factors. The 
distribution of the Partitive in particular is affected by factivity and the scope of negation (“Lausetyyppien 
kásittely transformaatioteoriassa [The treatment of sentence Types in Transformational grammar]" 1970: 
31, Hakulinen & Karlsson 1979: 365). For example, in (22a), the Partitive registers surprise or skepticism, 
and in (22b) the Accusative (realized as nominative) is likely to be interpreted factively. 
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The crucial case is (23), where the morphosyntactic Accusative Case of the object may be 
realized as morphological genitive case. Since the matrix verb is subjectless, the object’s 
realization as morphological genitive case must be licensed by the subject of the gerund, 
Matin. Therefore the subject has structural Case. 


(23) Ilmen-i Mati-n | ampu-nee-n karhu-n / karhu. 
turn.out-PsT.3sc Matti-GEN shoot-PERF.PRTC-GEN bear-ACCgpgy / bear.ACCyom 


‘Tt turned out that Matti shot the/a bear. 


This completes the fourth argument that the genitive subject of gerunds is a structural 
subject. 

In contrast, the fact that “quirky” genitive subjects induce the nominative form of the 
object tells us, by Jahnsson’s Rule, that they are non-structural: 


(24) a. Hanen pitàà osta-a auto. 
He-GEN must-3sc buy-lINF car.ACCyom 


‘He has to buy the/a car: 
b. Hànen on helppo nosta-a tama sákki. 
He-cEN be-3sc easy — lift-1rr this.acCyom sack.ACCyom 


‘It is easy for him to lift this sack’ 


This is as expected, since they are not assigned structurally but idiosyncratically by par- 
ticular predicates. 

A fifth argument that gerunds have structural subjects is that they can have a generic 
null subject proar. In Finnish proarp can be a subject (Hakulinen & Karttunen 1973) 
but it cannot be a possessor: contrast (25a) and (25b). So, the fact that gerunds can 
have a generic prog,» subject, as seen in (25c), is another datum in support of the claim 
that gerunds have structural subjects and not possessors. Moreover, gerunds can be 
subjectless under the same conditions as subjects of finite clauses. For example, gerunds 
can have the impersonal passive form, see (25d). 


(25) a. Siellá voi tanssi-a. 
there pro can-3sc dance-1iNF 
‘One can dance there: 


b. *On mukava katsel-la Q valokuv-i-a. 
be-3sc nice ` look.at-1tNr pro photo-PL.PART 


*'It's nice to to look at one's photos. (OK without Ø: ‘It’s nice to look at 
photos: 


c. Siellà vàite-t-ààn Ø voi-va-n tanssi-a. 
there claim-PsT.PAsS pro can-PRES.PRTC-GEN dance-lINF 
‘It is claimed that one can dance there’ 

d. Siellà vaite-tt-iin voi-ta-va-n tanssi-a. 
there claim-PsT.PASS can-PASS-PRES.PRTC-GEN dance-1INF 


‘It was claimed that there is dancing there? 
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I conclude that Finnish gerunds are IPs like English gerunds, albeit with a different 
syntactic distribution due to their oblique Case specification. 


2.3 Desultory remarks on Acc-ing 


The English “Acc-ing” construction differs in many ways from the “Poss-ing” gerund 
considered here so far. I have no serious analysis of it to offer. Its behavior resembles Acc- 
Inf (“ECM”) constructions in some ways. First, unlike gerunds with genitive subjects, it 
is degraded by intervening adverbs, extraposition, and fronting, under roughly the same 
conditions as nominal objects (“Quantification, Events, and Gerunds’; Pires 2006): 


(26) a. We anticipated ("?eagerly) him leaving Paris. 


b. (We anticipated his resignation, but) *?him/his leaving Paris we did not 
anticipate. 


This is the same pattern as: 


(27) a. We believe ("?strongly) him to have told the truth. 


b. (We believed him to have been mistreated, but) "?him to be telling the truth 
we did not believe. 


Acc-Inf gerunds allow extraction, like Acc-Inf complements and unlike Poss-ing ger- 
unds: 


(28) a. Which city do you remember him/*his describing? (Portner 1995: 637, citing 
L. Horn) 


b. Who do you resent Bill/*Bill’s hitting? (Williams 1975: 263) 
c. Who/*whose do you resent hitting Bill? (cf. "Who do you resent (it) that hit 
Bill?) 
d. Who do you believe to be telling the truth? 
e. What do you believe him to be saying? 
Another frequently noted difference between the constructions is that the genitive sub- 


ject of gerunds is preferentially human, and cannot be expletive there at all, whereas the 
accusative is unrestricted in this respect, again like Acc-Inf subjects. 


(29) a. There (there's) being no objection, the proposal is approved. 

b. ?Iimagined the water's being 30 feet deep. 
Accusative subjects of gerunds do not seem to be getting their case from the main verb, 
since they can appear in gerunds that function as subjects. Possibly the accusative case 


assigner is a null preposition or complementizer, an analog of the overt for of for-to 
infinitives. 
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3 Agent nominals, transitive and intransitive 


Like the FNT, my alternative theory of nominalizations is in principle applicable to every 
type of nominalization, including agent nominalizations and result nominalizations. The 
mixed category that most gravely challenges analyses of agent nominals is transitive 
agent nouns, which function as nominals except for assigning structural case to their 
objects and allowing some adverbial modifiers. I will make a case that, just as gerunds are 
categorially verbal at all levels of the syntax and their noun-like behavior is entirely due 
to a nominal Case feature borne by their Infl head, such transitive agent nominalizations 
are categorially nominal at all levels of the syntax and their verb-like behavior is entirely 
due to a verbal feature borne by their nominalizing head, namely Aspect. Gerunds and 
transitive nominalizations thus prove to be duals in a sense - respectively verbs with 
Case and nouns with Aspect. 

I show that this idea predicts the distinction between transitive and intransitive agent 
nouns, whereas the functional properties of nominalizations neither correlate with each 
other as the FNT predicts, nor match the height of their nominalizing heads in syntax 
or word structure. In Vedic Sanskrit (sections 3.1-3.3) and in Finnish (2.2), high agent 
nominalizations do not assign structural case if they lack Tense/Aspect features, and even 
low agent nominalizations do assign structural case if they have Tense/Aspect features. 


3.1 Agent nominals and subject nominals 


In their illuminating study based on the FNT approach, Baker & Vinokurova (2009) pro- 
pose an analysis and typology of agent nominalizations similar to the one I have called 
into question for event nominalizations. They begin by noting an asymmetry between 
agent and event nominals. “High” event nominals like (1a) have no agent noun counter- 
part such as (30). 

B&V claim that this is a systematic gap, and propose to explain it on the basis of two 
key assumptions. First, agentive nominalizing morphology is added by a nominal head 
immediately above VP. Secondly, in some languages, such as English, structural case 
is assigned to objects by an active Voice/v head, whereas in other languages, structural 
case is assigned configurationally (dependent case).? Together, these assumptions rule 
out transitive agent-denoting nominalizations, such as (30) *the reader the book. Instead, 
they require the structure (31). Here the agent nominalizer pre-empts the case-assigning 
active Voice morpheme in v that assigns structural case to objects in English, but (by 
hypothesis) has no case-assigning force itself. 


18 Tt is fair to ask why it is added there and not in a higher position. B&V hint that this is “a position apparently 
forced on it by the natural (iconic) semantic composition of the clause" (Baker & Vinokurova 2009: 521), 
but this remains to be justified. 

19 BEV equate Voice with v, following Kratzer (2004), but contra Alexiadou (2001); Alexiadou, Soare & Iorda- 
chioaia (2010); Harley (2012), among others. 
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(30) * the reader of the book 
DP 


read the book 


case 


V DP 


NET dm 


read ofthe book 
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The analysis further predicts that, since voice markers cannot attach to unaccusatives,?? 


such agent nouns cannot attach to unaccusative verbs. 

B&V then draw a distinction between agentive and non-agentive "agent" nominalizers 
- let's call the latter SUBJECT NOMINALIZERS. Subject nominalizers do assign structural 
case, and can be attached to unaccusative verbs. B&V (p. 547) analyze them as "nominal 
equivalents of an AsPECT head", in the sense in which agentive nominalizers like -er are 
nominal equivalents of a voice head. Their example is Giküyü -i, another example is 
Northern Paiute -di (Toosarvandani 2014). B&V propose the structure (32): 


(32) Subject nominalizers (high) 
NP; 


slaughter goats 


ACC case 


As an immediate challenge to the FNT in the domain of agent nominalizations, B&V 
note that otherwise low agent nominalizations unexpectedly assign structural case in 
some languages. For B&V, these languages must be special in that they assign structural 
case by a dependent case mechanism, whereas languages in which low agent nouns have 
oblique complements assign structural case by little v. The need to maintain two entirely 
distinct mechanisms of structural case assignment on the basis of evidence that cannot 
loom large in the learner’s experience would be another disappointing consequence of 
the FNT.?! We'll also see that B&V's analysis of agent nominals imposes a functional 


20 Tn fact an incorrect premise: unaccusative verbs passivize in numerous languages, including Finnish and 
Sanskrit (Kiparsky 2013). 

?! Levin, Preminger & Omer (2015) propose that all structural case can be assigned by dependent case, pro- 
vided that the algorithm is parametrized in certain ways. However, they do not touch on the case variation 
in objects of agent nominalizations, and the parametrization of structural case assignment that they pro- 
pose does not account for it, as far as I can tell. 
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overload on little v that makes the FNT’s various criteria for syntactic height mutually 
irreconcilable. 

In summary, B&V’s proposal generates the typology of agent nouns shown in Table 2. 
In the remainder of this section I show that the predicted correlations do not hold for 
agent nouns of Vedic Sanskrit and Finnish, and propose a much simpler alternative that 
does justice to the data. 


Table 2: Typology of agent nouns predicted by Baker & Vinokurova (2009). 


Agent Nominalizers (low, v) Subject Nominalizers (high, Asp) 
always agentive non-agentives OK 
no unaccusatives unaccusatives OK 
structural case only if dependent case structural case 
no adverbs adverbs OK 
no Aspect compatible with Aspect 
no Voice compatible with Voice 


3.2 Vedic agent nouns 


Vedic and Paninian Sanskrit has a large number of agent noun suffixes, which fall into 
two clearly demarcated types. A minimal pair that highlights the contrast are the two 
agent noun types in accented -tár-" and preaccenting -tar-"?? Agent nouns in accented 
-tár-" have genitive objects and get only adjective modifiers, never adverbs, e.g. (33a). 
Agent nouns in preaccenting -tar-" (boldfaced in (33)) regularly assign structural case 
to their objects and, can get certain aspectual adverb modifiers, such as punah ‘again’ in 


(33b). 
a. tvá-m hí satyá-m ..vid-má datar-am is-ám 
33 n h y d d S 
you-ACC PRT true-Acc ... know-1PL giver-Acc good.thing-PL.GEN 
^we know you as the true giver of good things: (RV. 8.46.2) 
b. ís-kar-ta víhruta-m pünah 
fixer-NOM wrong-Acc again 
‘the maker right again (of) what has gone wrong: (RV 8.1.12) 


Both suffixes are true nominalizers: they form nouns, not verbs. They have a complete 
nominal case and number inflection paradigm, take denominal derivational suffixes, such 


22 Their Indo-European provenance is guaranteed by Greek and Avestan cognates (Lowe 2014). The following 
exposition of their contrasting semantics, morphophology, and syntax draws on the generalizations and 
evidence in Kiparsky 2016, to which the reader is referred for details. 

23 Other agent nouns with verbal properties are attested in early Vedic include -i-V RV 9.61.20 jaghnir vrtram 
‘killer of Ytra’, -(i)snu-V RV 1.63.3 dhrsnür etán ‘bold against them’, -u-V AV 12.1.48 nidhanám titiksuh 
‘enduring poverty’, -2-V RV 1.1.4 yam yajfiám ... paribhár ási ‘the sacrifice that you embrace’. 
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as derived feminines, and can be compounded.” They allow adjectival modification (in 
addition to adverbial modification, in the case of -tar-"). These nominal properties are 
unsurprising for the noun-like -tár-" formations; that they hold also for the more verb- 
like -tar-" is documented in (34). 


(34) a. asim jétaram 
asu-m jé-tàr-am 
quick-Acc win-er-Acc 


‘the quick (Acc.) winner (Acc.)’ (RV 8.99.7) 
b. tásteva pstyamayi 

taks-tar iva prstya-àmay-ín 

carve-er.NOM like back-ache-ed.NoM 

"like a notalgic (Nom.) carpenter (Nom.)' (RV 1.105.18) 


Semantically both -tár-" and -tar-" are agent nominalizers, not subject nominaliz- 
ers: they are never added to non-agentive verbs or unaccusatives of any kind, and the 
meaning of the nominalization is canonically agentive.? So by these criteria both nom- 
inalizations are "low" in the sense of B&V, not Giküyü-type "high" nominalizations. 

The agent nominalizers -tar-" and -tár-" form a privative semantic opposition, missed 
in the modern philological literature but correctly delineated already by Pànini, whose 
description turns out to tally perfectly with the Vedic data. The unmarked member of 
the opposition is -tár-", which simply denotes agency (like English er). The marked 
member ^-tar-" has two additional meaning components: 


(35) a. -tar-" denotes agency in ONGOING TIME. 


b. ‘-tar-” denotes HABITUAL, PROFESSIONAL, Or EXPERT agency. 


The criteria of the FNT make contradictory predictions. Since both nominalizations 
are agentive, both should be structurally low “little v” heads. On the other hand, tar-V 
nominalizations, which have the verbal properties of assigning structural case and al- 
lowing adverbial modification, should be structurally high, while -tar-N nominalizations, 
which have strictly nominal properties, should be structurally low. Neither of these is 
the case. In fact, as far as the case and adverb properties are concerned, the structure is 
just the opposite of what is predicted: verbal -tar-" is low and nominal -tár-" is high. 
This is shown by four arguments (details in Kiparsky 2016). 

The first argument that verbal -tar-" is low and nominal -tár-" is high is their morpho- 
logical position in the word. -tar-" always follows the bare verbal root directly, without 


24 E.g. ksirá-hotar- /milk-offerer' (SBr.), and nesta-potarau ‘leader and purifier’ (TS.), co-compounds (Kiparsky 
2010b) denoting pairs of priests. 

?5 Thus, the following roots do not take either -tár-N and "-tar-V or any other agent suffixes for that matter: 
as ‘be’, ds ‘sit’, sī ‘lie’, sru ‘flow’, plu ‘float’, tras ‘tremble’, vyath ‘sway’, bhrams ‘fall’, svap ‘sleep’, ksudh ‘be 
hungry’, trs ‘be thirsty’, svid ‘sweat’, rdh ‘flourish’, ru(d)h ‘grow’, pyà ‘swell’, ris ‘sustain damage’, mr ‘die’, 
$am ‘become calm’, mad ‘get drunk’, mud ‘rejoice’, hrs ‘get excited’, dhrs ‘dare’, bhi ‘fear’, hid ‘be angry’, 
krudh ‘become angry’, grdh ‘be greedy’, ruc ‘shine’, $ubh ‘shine, be beautiful’, bhd ‘shine’ bhas ‘gleam’, dyut 
‘to strike’ (of lightning), pat ‘to fall’. 


V 


333 


Paul Kiparsky 


any other intervening suffix; it cannot be added to compound or prefixed bases. -tár-", 
on the contrary, can be separated from the root by verb-to-verb suffixes commonly ana- 
lyzed as Voice/v heads (causative, denominative, intensive, desiderative). It is affixed to 
the whole verb base, including the extended root plus any prefix that combines with it: 


(36) a. RV cod-ay-i-tr-i- ‘impeller (fem.)’ (caus. cod-áy-a-ti ‘impels’) 
b. TS pra-dap-ay-i-tar- ‘bestower’ (caus. pra-dap-ay-a-ti ‘bestows’), 
c. ni-dha-tár- ‘one who sets down’ (ní-dadha-ti ‘sets down’) 
The morphological data point to the respective constituent structures in (37): 


(37) a. Low: [ Prefix [ Root tar-" ] ] 
b. High: [ [ Prefix [ Root (Caus)... ] ] -tár-" ] 


The second argument that verbal -tar-" is low and nominal -tár-" is high comes from 
word accentuation. The morphological conditioning of accent placement provides a con- 
venient probe into the constituent structure of words. In Vedic and Paninian Sanskrit, 
the accentuation of words is computed cyclically from the accentual properties of the 
morphemes from which they are composed. Morphemes may be accented or unaccented, 
and at the word level, all accents but the first in a word are erased (Kiparsky 2010a). Both 
of our agent suffixes (like the majority of derivational suffixes) belong to the accentually 
DOMINANT type: they erase the accent off the bases to which they are added. The crucial 
fact for present purposes is that dominant affixes exercise this erasing effect exactly on 
the stems to which they are added, no more and no less. Thanks to this property we can 
use accentuation to diagnose constituent structure in morphologically complex words. 

The empirical generalization is that prefixes always prevail over low (bare-root) suf- 
fixes, including -tar-", whereas high suffixes always prevail over prefixes, dictating the 
place of the word accent. The reason is that prefixes are added after the low suffix -tar-": 


(38) Prefixation to nouns with the the low suffix ‘-tar-": 


bhar- Root 

bhár-tar" add dominant preaccenting -tar-" 
pra-[bhar-tar] add accented prefix 

prábhartar- erase all accents but the first 


On the other hand, -tár-" is accentually dominant, causing all accents on its base to be 
deleted, and attracting accent to itself. This shows that it is added to the entire stem 
including the prefix, causing the resulting word to be accented on the suffix. 


(39) Suffixation of high -tár-" to prefixed verbs: 


bhar- Root 

ápa-bhar add accented prefix 
[apa-bhar]-tarN add dominant accented -tár-N 
apabhartár- 


The third argument that verbal -tar-" is low and nominal -tar- is high comes from 
tmesis, the splitting of prefixes from stems. Prefixes can be separated from verbs and 
from nominals formed with low suffixes like verbal -tar-". 
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(40) a. sáttà ni yónà (= nisatta yónà) ‘a sitter down in the womb’ (RV 9.86.6) 
b. úpa sure ná dhátà (= súre nopadhata) ‘like the Placer of the Sun’ (RV 9.97.38). 


Prefixes are never separated from nominals formed with high suffixes such as nominal 
-tár-N. 

The explanation comes from the same constituent structure that accounts for the ac- 
centual difference: low suffixes such as the agent suffix -tar-" are added directly to the 
root to form a noun, which can then be composed with a prefix (see (41a)), while high 
suffixes such as the agent suffix -tár-" are added to the entire verb, which may already 
bear a prefix and/or another suffix (41b,c). 


(4) a. N b. N 
N V 
Prefix E Ew -tár-N 
c. N 
V 
V 
Prefix taps -tár N 


It will be seen the prefix is an immediate constituent of the word in (41a), but not in 
(41b) or in (41c). The natural generalization is that a prefix can only be split if it is an 
immediate constituent of the word. 

The fourth argument that verbal -tar-" is low and nominal -tár-" is high comes from 
selectional properties of prefixes. Prefixes that only combine with verb roots require 
high -tár-", because the right-branching constituent structure (41a) would require them 
to combine with nouns.” Conversely, prefixes and other elements that cannot be com- 
bined with roots, only with nouns, require the right-branching constituent structure 


(41a), which is available either with -tar-N or with -tar-".? 


26 Many examples are given in Kiparsky 2016. One will have to suffice here. The interjection him ‘the sound 
hmm’ cannot be compounded with nouns. It can only combine with the root kr ‘do’, ‘make’. The agent 
noun from him-kr- ‘to make the sound hmm’ must therefore have the high suffix -tár-N. , viz. himkartár-. 

27 Again we must make do with a couple of examples. There is no compound verb such as *para-apara-i- ‘to 
go far and near’ from which párapara-etar- ‘one who goes far and near’ might be derived. In fact párapara- 
‘far and near’ is never compounded with verbs. Instead, the agent noun is a nominal compound formed 
from para-apara- ‘far and near’ plus e-tár- ‘goer’ (—— i-tár-N). Another illustration of this generalization 
is that the negation a- combines only with nouns. From hótar 'priest' (— hu- ‘tar-”) we get á-hotar- ‘a 
non-priest’. 
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The above arguments establish the morphological constituency displayed in (37) and 
(41). But Distributed Morphology is a resourceful theory that makes available various 
movement operations that cause mismatches between morphology and syntax. So could 
the morphologically low nominalizing morphemes be spelled out high where B&V pre- 
dict they should be, and then undergo Lowering to their actual position? And conversely, 
could the high nominalizing morphemes be spelled out low as predicted, and then un- 
dergo Raising to their actual position? The answer is negative on both counts. 

The way morphologically low suffixes such as the agent suffix -tar-" could be syn- 
tactically high for purposes of the FNT is by DM’s LowzniNG operation, which applies 
before Vocabulary insertion to adjoin a head to the head of its complement (Embick & 
Noyer 2001): 


(42) Xo lowers to Yo: 
[xp Xo ...[ve -.-Yo ...]] — [xp ...[ve ---[Yo YotXo].--.]] 


In English, Lowering of T is assumed to adjoin T to v (Embick & Marantz 2008). But if 
-tar-Y is generated in the N head of (32) and is lowered from there into the v, it would, on 
B&V's assumptions, have the properties of a subject nominalizer, forming nouns from 
unaccusative and non-agentive verbs. But it does not have any properties of a subject 
nominalizer — it is a true agent nominalizer, as we showed above. 

The other thing required to maintain the FNT is to generate -tár-" low in v (as in (31)) 
and then raise it to its actual high position. This is not going to work in the DM model 
either, for morphological head-raising can only adjoin a head to the next head above it 
(Harizanov & Gribanova 2016), and in this case it would have to skip intervening heads, 
including the v head that may be occupied by causative and other V—V suffixes, which 
must not raise. Moreover, in more recent DM (Embick 2010), phonology is cyclically 
interleaved with morphology, and this would cause problems with the abovementioned 
accent erasure and tmesis phenomena. 

It should also be noted that -tar-" is overwhelmingly preferred when its special mean- 
ing and morphological restrictions allow its use, and -tár-" is used elsewhere. Moreover, 
other agent suffixes supersede each of them with particular roots and/or in particula 
special meanings. Since competition in DM obtains only between morphemes that have 
the same meaning and are realized in the same slot (such as English plural -s and -en), 
all these competing suffixes would have to be generated in the same syntactic position. 

The conclusion from the Vedic data is that the nominalizations' verbal vs. nominal 
properties do not correlate with structural height of their heads in the word or in the 
syntax. In fact, the majority of morphologically high nominalizers in Vedic have nominal 
properties, and the majority of morphologically low nominalizers have verbal properties 
- the opposite of what the FNT predicts. 


3.3 Aspect in Vedic agent nouns 


A preliminary survey of nominalizations suggests that the Aspect feature of a nominal- 
izer is the best predictor of verbal properties. Consider the following alternative to the 
FNT. 
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(43) The Aspect hypothesis 
Nominalizations assign structural case if and only if they have Aspect, either as 
an inherent feature of the nominalization, or in virtue of combining with overt 
Aspect morphology. 


By aspect features I mean outer aspect features such as imperfective and habitual, not 
inner aspect (Aktionsart), such as telicity. Sanskrit agent nouns in -tar-" are inherently 
present/imperfective and habitual. Those in -tár-", English agent nouns made with er. 
and Finnish agent nouns made with -ja have no inherent aspect: a driver can be a habitual 
or professional driver, or just someone who happens to be at the wheel on a particular 
occasion.?? 

The Aspect hypothesis is not implausible a priori because Aspect features are cross- 
linguistically known to affect case assignment - think of split ergativity based on imper- 
fect vs. perfect aspect, and accusative vs. partitive objects in Finnish depending on grad- 
ability (Kiparsky 20052). It looks promising for Vedic Sanskrit in particular because nom- 
inalizing endings with verbal properties, such as -tar-", are added to the bare root, in the 
same morphological position as the Aorist and Perfect Tense/Aspect suffixes. It is also 
consistent with the fact that Northern Paiute deverbal nominalizations, which assign 
structural accusative case, can have overt aspectual morphology below them (Toosar- 
vandani 2014: 793, fn. 6). 

The Aspect hypothesis is compatible with a lexicalist treatment of morphology. A 
Distributed Morphology analysis of Vedic agent nouns is problematic because of the 
conflicting criteria for structural height. In addition, they show a type of competition 
between morphemes that DM rejects. The semantically nondescript -tár-", structurally 
low by B&V's criteria but high in word structure, is the default (elsewhere) case. The se- 
mantically restricted -tar-" suffix, structurally high by B&V's syntactic criteria but low 
in the word morphology, is strongly preferred whenever it is applicable, namely to de- 
note habitual agency in ongoing time with morphologically simple verbs. Elsewhere the 
default is -tár-", structurally low by B&V's syntactic criteria but high in the word morph- 
ology - for past or future agency, or occasional agency, or when the verb is morpholog- 
ically complex (causative, intensive, desiderative, denominative, or prefixed. Suppose 
then that a structurally low agent is added in a syntactic derivation. The derivation must 
crash if and only if a structurally high agent can be successfully added in a competing 
derivation. But DM does not allow rules that spell out syntactico/semantic features in 
different positions to compete with each other. Moreover, if we assume bottom-up mor- 
phological spellout of the syntax (by cycles or phases), the syntactically low agent would 
have to "know" about the upstairs high agent in order to be blocked by it. On the other 
hand, in a morphological theory of word-formation, morphologically low items naturally 
block morphologically high items. Besides, blocking of affixes with general meanings by 
affixes with special meanings regardless of the locus of affixation is straighforward in 
lexicalist approaches such as those of Wunderlich (1996b; 2001) and Kiparsky (2005b). 


28 Gerunds are arguably inherently imperfective (Alexiadou 2001; Alexiadou, Soare & Iordáchioaia 2010). 
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3.4 The Finnish subject nominalizer -ja 


B&V's formulation of the FNT entails that agent nominalizations don’t assign case and 
subject nominalizations do (82). We have seen that Vedic falsifies the first of these claims. 
Finnish (among many other languages) falsifies the second. The fully productive Finnish 
suffix -ja is not an agent nominalizer, but a subject nominalizer, which is to say a high 
nominalizer in B&V's typology. It can go on non-agentive/unaccusative verbs, and freely 
attaches to causatives, often assumed to be under v, as well as denominatives, reflexives, 
inchoatives, and inner aspect morphemes such as frequentatives and semelfactives, thus 
testing positively for high position by several diagnostics. 


(44) a. kuolija ‘one who dies’, ‘dier’, eläjä one who lives’, ‘liver’, toipuja ‘one who 
gets well’, ‘convalescent’, olija ‘one who is’, osaaja ‘one who is able to’, 
syntyjá ‘one who is born’, hikoilija ‘one who sweats’, putoaja ‘one who falls’, 
turpoaja ‘one who swells’, pelkääjä ‘one who fears’, luulija ‘one who 
supposes’, tuntija ‘one who knows, expert’, muistaja one who remembers’, 
jáájà ‘one who remains’, palelija ‘one who feels cold’, tarvitsija ‘one who 
needs’, hukkuja ‘one who drowns’ 

b. i. Frequentative -ele-: kys-eli-ja ‘inquirer’, from kys-ele- ‘to make inquiries’ 
(cf. kysy-já ‘asker’, from kysy- ‘ask’). Similarly ryypiskelija ‘tippler’, 
láhentelijá ‘harasser’, myyskentelijà ‘peddler’, rehentelijà ‘bragger’, 
rütelijà ‘quarreler’. 

ii. Causative: laula-tta-ja one who makes sing’, from laula-tta- ‘to make 
sing’, cf. lauja-ja ‘singer’, from laula- ‘to sing’. 

iii. Inchoative + causative: selv-en-tá-jà ‘clarifier’, — selv-en-tá- ‘make clear, 
clarify’ («— selv-en- ‘become clear’ — selvä ‘clear’). 

iv. Causative + frequentative: sopi-ja ‘agreer’, sovi-tta-ja ‘fitter, arranger’, 
sovi-tt-el-ija ‘reconciler, negotiator’ 

v. Reflexive: puolusta-ja ‘defender’ puolusta-utu-ja ‘(self-)defender’ 


vi. Denominative: testamentt-aa-ja ‘bequeather’ (<— testamentt-at- ‘to 
bequeath’ <— testamentti ‘testament’) 


However, nominals in the suffix -ja do not assign structural case, whether they have any 
of these suffixes below them or not. Their object complement (unlike that of passive 
verbs) can only receive genitive case. 
(45) a. palkinto-j-en (“palkinno-t) saa-ja 

prize-PL-GEN (prize-PL.ACC) get-er.(NOM) 

'the/a winner of the prizes' 

b. minu-n (*minu-t) kàv-el-ytt-eli-jà-ni 
me-GEN (*me-Acc) walk-FREQ-CAUS-FREQ-er.(NOM)-1sG.POSS 
‘(the) one who frequently takes me around for walks’ 


Nominals in -ja do take oblique nominal modifiers, as do all nominalizations, including 
action nominalizations, and even to some extent ordinary basic nouns. 
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(46) a. Saksa-sta voitta-ja-na palaa-ja / pal-uu 
Germany-ELAT win-er-ESSIVE return-er.(NOM) / return-ing.(NOM) 
‘(the) one who returns / a return from Germany as a winner’ 
b. hallitse-va-ssa asema-ssa oli-ja / ol-o 
govern-in-INESS position-INEss be-er.(Now) / be-ing.(NOM) 
‘(the) one who is in a governing position’ 


c. palatsi Cannesi-ssa 
palace Cannes-INEss 


‘the/a palace in Cannes’ 
They generally do not take adverbs, except for certain perfectivizing adverbs (47d,e): 
(47) a. *nopea-sti juoksi-ja 
quick-ly run-er.(Nom) 
'(the) one who runs/ran/will run quickly’ 
b. *kilpailu-n taas voitta-ja 
competition-GEN again win-er.(NOM) 
‘(the) one who wins/won/will win the/a competition again’ 
c. *aina  matkusta-ja 
always travel-er.(NoM) 
‘(the) one who always travels’ 
d. kilpailu-sta ois jaa-ja 
P p Jaa- 
competition-ELAT away remain-er.(NOM) 
‘one who does not join the competition’, ‘eliminee’ 
e. viime-ksi tuli-ja 
lat-TRANSL come-er.(NOM) 


‘the last to arrive’ 


In (48) (an example adapted from the internet) the adverb jálleen ‘again’ appears with an 
agent noun. 


(48) Cannesi-ssa jälleen palkinno-tta jaa-ja 
Cannes-INEss again prize-ABESS remain-er.(NOM) 


‘one who ended up prizeless again in Cannes’ (lit. a remainer prizeless again...) 


Possibly it modifies not the nominalization but the abessive modifier palkinno-tta ‘prize- 
less’. 

Since Finnish -ja must be high in order to get a non-agentive interpretation and to 
scope over every kind of verb-to-verb suffix, it should assign structural case, which it 
doesn’t. So it does not fit into B&V’s syntactic typology, and constitutes a problem for 
the FNT generalization. In this case, morphological raising or lowering, even if they 
were available and motivated, would not help to resolve the contradiction. 
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The lexicalist alternative, however, holds up. Like English -er, nouns in -ja are mor- 
phologically incompatible with overt Aspect or Voice morphology, and they refer indif- 
ferently to prospective, present, or past events, hence have no inherent Aspect features. 


(49) maksaja ‘payer’ (one who has paid, is paying, or will pay), similarly ostaja ‘buyer’, 
vuokraaja ‘renter’, maahanmuuttaja ‘immigrant’, lähtijä ‘goer’, siittàjá ‘insemina- 
tor’ 


Since -ja has no Aspect features and no verbal properties, the fact that it doesn’t assign 
structural case is consistent with the Aspect hypothesis but inconsistent with the FNT. 
I conclude that Finnish -ja supports the lexicalist analysis of nominalizations. 


3.5 The Sakha agent nominalizer -AAccY 


Baker & Vinokurova (2009: 536) note that the correlation predicted by the FNT breaks 
down for Sakha agent nominalizations, which have structural accusative objects but 
otherwise conform to the low type, in that they have no Aspect morphology or adverbs. 
Their solution is that Sakha accusative case is not assigned by Voice/v but by a different 
mechanism, DEPENDENT CASE assigment. 


(50) Dependent case assignment (Marantz 1991; Baker 2015) 


a. If there are two distinct NPs in the same spellout domain such that NP; 
c-commands NP», then value the case feature of NP, as accusative unless NP, 
has already been marked for case. 


b. If there are two distinct NPs in the same spellout domain such that NP; 
c-commands NP», then value the case feature of NP; as ergative unless NP2 
has already been marked for case. 


Their main argument that Sakha accusative is dependent case is that objects of passives 
receive accusative case. This argument depends on the fragile assumption that the im- 
plicit agent of Sakha passives is a syntactically visible but phonologically null NP, which 
receives nominative case and serves as the NP, that triggers the assignment of accusative 
case to the object of the passive by (50a). 

I am skeptical of this solution for both theoretical and empirical reasons. It would 
be strange for UG to offer two entirely different methods of structural case assignment, 
since their empirical differences are rather obscure, and offer learners of most languages 
little core data to choose between them. Secondly, the analysis of impersonal passives as 
having an invisible nominative agent subject is excluded on general grounds by any kind 
of demotion analysis of passive, including the typologically grounded theory proposed 
in Kiparsky 2013. Finnish provides empirical evidence against the idea that objects of 
passives receive dependent structural case because of a syntactically visible but phono- 
logically null nominative implicit agent. The object of passives in Finnish is assigned 
structural case as in Sakha, but the case cannot possibly be assigned by the dependent 
case algorithm (50), for the implicit agent of passives in Finnish is invisible to case as- 
signment, as clearly demonstrated by Jahnsson's Rule (20b), see e.g. (22c,d). 
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Our approach predicts the transitivity of Sakha agent nouns in -aaccy out of the box. 
The reason is that they have an aspect feature. Agent nouns in Sakha -aaccy denote 
specifically habitual or generic agents. B&V (p. 531) illustrate this generalization with 
the following examples:?? 


(51) a. ynaq-y ólór-óóccü 
cow-Acc kill-AG.NoM 
‘a killer of cows, a butcher’ 
b. *Misha-ny ölör-ööccü 
Misha-Acc kill-ac.Nom 
‘the killer of Misha’ 


The habitual aspect feature of the Sakha agent nouns licenses its structural case assign- 
ment just as the habitual aspect feature of agent nouns formed by the Sanskrit agent 
suffix -tar-" does, as opposed to aspectually void agent nouns in -tár-^, Finnish -ja, and 
English -er. This accounts fully for the case data without resorting to the unsupported 
syntactic height distinctions demanded by the FNT. 

Summing up our conclusions about agent nominalizations so far: the syntactic FNT 
is falsified by Vedic agent nominalizations in one direction, and by Finnish subject nom- 
inalizations in the other, and it requires an otherwise unsupported parametric choice 
between two heterogeneous structural case assignment algorithms. The analysis reveals 
that little v can't do all ofthe following things: (1) introduce Agents, (2) host voice heads 
or agent nominalizer heads, (3) host causative V affixes, (4) host aspectual material, and 
(5) assign structural case. In agent nominals it is not possible to place nominalizing heads 
above or below little v in a consistent way that satisfies all of (1)-(5). (1) and (5) cannot be 
reconciled with an agent nominalizer that assigns structural case such as Sanskrit -tar-", 
or with a subject nominalizer that does not assign structural case such as Finnish -ja. The 
Sakha nominalizer -AAccY can dominate causatives (high) but not aspectual adverbs - 
a conflict between (3) and (4) - and introduces agents (low) but assigns structural case 
(high) - a conflict between (2) and (5). 

These difficulties fall away if we assume that that agent nominals are nouns, and that 
nouns assign structural case if and only if they have Aspect features. 


4 Conclusion 


The Functional Nominalization Thesis claims that so-called “mixed categories" arise when 
a nominal head is affixed to an extended verbal projection that is its syntactic comple- 
ment. My findings instead support a lexicalist approach, in which mixed categories are 
projections of a nominal or verbal heads with an extra phi-feature. Their extended pro- 
jections behave like normal extended projections modulo the properties enforced by that 
feature. 


29 This component is foregrounded in the related habitual participle function of the same suffix: e.g. 
salaj-aaccy means both ‘manager’ (agent noun) and ‘habitually managing’ (participle), see Vinokurova 
(2005: 123). 
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In §2 I argued that gerund phrases are not DPs/NPs with AspP complements. They 
are not even nominalizations. They are participial phrases — IPs with a Case feature 
that is checked or valued in an argument position. In all other respects their syntax is 
entirely clausal: they lack DP material such as articles, demonstratives, quantifiers, and 
adjectives, and they are formally built like IPs, complete with structural subjects. The 
lexicalist analysis explains these properties. 

In §3 I argued that agent nominalizations that assign structural case to their objects are 
not nouns with vP complements (or with any other phrasal complements), but deverbal 
nouns derived by agent suffixes that have an Aspect feature. The Aspect feature makes 
the nouns transitive, and modifiable by aspectual adverbs. Otherwise their syntax is 
entirely nominal. The merit of this analysis is that it tightly correlates the transitivity of 
agent nouns with their aspectual meaning. Also, by relieving the burden on little v, it 
eliminates the mismatches between word structure and syntax that we found in Vedic 
and Finnish agent nouns under the FNT analysis. 

The lexicalist approach retains the key idea of the FNT without the typologically un- 
warranted overgeneration caused by allowing syntactic affixation. It preserves a uniform 
mechanism of structural case assigment, a unified analysis of true nominalizations, and 
the insights that originally led Chomsky to lexicalism. 
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Chapter 16 


On mechanisms by which languages 
become [nominative-]accusative 


Ashwini Deo 
The Ohio State University 


New Indo-Aryan languages are characterized by accusative (DOM) objects in ergative, per- 
fective clauses. This paper traces the emergence of this ergative—accusative marking pattern 
with the goal of determining whether it is to be considered part of a single “de-ergativization” 
trajectory, in which languages gradually lose aspects of their ergative orientation in analogy 
to the non-ergative portion of the grammar. Data from Middle Indo-Aryan suggests that ac- 
cusative marked objects — a deviation from the classic ergatively-oriented sub-system — can- 
not be analyzed in terms of the analogical extension of any existing nominative-accusative 
model or as a reduction of markedness. In contrast, the empirical facts of Indo-Aryan di- 
achrony align better with the possibility that such deviations have to do with independent 
changes in the broader argument realization options for the language. This is consistent with 
Anderson's (1977; 2004) claim that a significant part of the explanation for ergativity-related 
patterns lies in patterns of diachronic change rather than abstract structural considerations 
of Universal Grammar. 


1 Introduction 


The term ergative is used to refer to a grammatical relation marking pattern in which 
the object of a transitive verb patterns with the single argument of an intransitive verb 
(surfacing with absolutive case), while the transitive subject patterns distinctly (surfac- 
ing with ergative case) (Dixon 1979; 1994; Comrie 1978; Plank 1979). It has sometimes 
been claimed that there is a clear asymmetry between the pervasiveness of ergative- 
absolutive vs. nominative-accusative marking systems across sub-domains of grammars 
in languages. 


No ergative language is fully consistent in carrying through the ergative principle 
throughout its entire morphology, syntax, and lexicon: all languages that exhibit 
ergative patterning in their commonest case-marking system also exhibit some 
accusative pattern somewhere in the rest of their grammar. (Moravcsik 1978, p.237) 


Ashwini Deo. 2017. On mechanisms by which languages become [nominative- 
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A possible way of interpreting this stated generalization is to take it to refer to the pres- 
ence of accusative case-marking in ergative languages - that is, that every language with 
an ergative-nominative case marking or agreement pattern also exhibits a nominative- 
accusative pattern in some subsystem of the grammar. However, this interpretation is 
clearly not borne out since several languages exist that have ergative case but lack ac- 
cusative case marking altogether! Coon & Preminger (to appear) interpret the above 
claim to mean that even in languages which show a high number of ergative character- 
istics, there can generally be found some portion of the grammar in which the ergative 
pattern is lost, and transitive and intransitive subjects are treated alike. In this case, the 
term “ergative pattern" seems to refer, not to surface morphological properties, but more 
broadly to syntactic properties like control and binding with respect to which the highest 
arguments of a clause may pattern alike. Split-ergativity is a term reserved specifically 
for morphological marking patterns and refers to the systematized occurrence of a mixed 
indexing system, which is ergatively organized in well-defined syntactic-semantic con- 
figurations with nominative-accusative marking elsewhere in the language. The ques- 
tion of how such systems arise in natural languages and change (or persist) through time, 
as well as the possible diachronic reasons for the parameters on which the split is based, 
can only be answered by an investigation of split-ergative languages for which we have 
some clear diachronic record available. 

Anderson (1977, and later in 2004)has suggested that to the extent we have such infor- 
mation, changes involving ergative orientation seem to be “consequences of relatively 
superficial phenomena.” According to him, ergative patterning is not a deep syntactic 
property of linguistic systems but rather an emergent effect arising from several distinct 
trajectories in the morphological systems of languages. In effect, there is no principle 
that determines an "ergative" or "accusative" pattern; rather languages may innovate or 
lose specific cases such as ergative or accusative, with such patterns arising more as emer- 
gent effects of the change and not as abstractly determined invariant objects. This paper 
examines one such emergent effect in trajectories associated with systems containing 
ergative case — the emergence of overt accusative (object) marking in ergative clauses. 
New data from Late Middle Indo-Aryan (MIA) and Early New Indo-Aryan (NIA) suggests 
that transitions resulting in deviations from the classic ergatively-oriented sub-system 
in a split ergative language cannot be analyzed uniformly in terms of the analogical ex- 
tension of any existing nominative-accusative model or as a reduction of markedness. 
In contrast, the empirical facts of Indo-Aryan diachrony align better with the possibil- 
ity that such deviations have to do with independent changes in the broader argument 
realization options for the language. This is consistent with Anderson's claim that a 
significant part of the explanation for ergativity-related patterns lies in patterns of di- 
achronic change rather than abstract structural considerations of Universal Grammar 
(contra Delancey 1981; Dixon 1994; Tsunoda 1981). 


! An anonymous reviewer points to languages like Chukchi, Tabassaran, Chamalal, Tzutujil, Central Yupik 
Eskimo, and Burushaski that lack an accusative case, and therefore lack nominative-accusative “patterning” 
in terms of case marking. 
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2 Morphosyntactic changes in Middle Indo-Aryan 


2.1 The emergence of ergativity 


One well-discussed source for ergative marking in natural languages is a passive clausal 
structure that gets reanalyzed as active. Oblique marking on the optionally surfacing 
agent is reanalyzed as ergative case while the unmarked subject of the passive clause 
surfaces as absolutive object, identical to the subjects of intransitive clauses. Indo-Aryan 
languages bear the most concrete diachronic record for such a passive-to-ergative shift 
scenario. In the history ofthese languages, a passive construction with resultative seman- 
tics was reanalyzed as an active, ergative clause with perfective aspectual reference at 
least by the time of Epic Sanskrit (Old Indo-Aryan (OIA)) and Early MIA (Andersen 1986; 
Peterson 1998; Condoravdi & Deo 2014 a.o.)? In the oldest Vedic texts, the -ta-affixed 
form of the verb serves to describe a result-state brought about by a preceding event 
when it is used predicatively in an adjectival passive construction. The -ta forms (bold- 
faced) in (1a) agree with the nominative patient while the agent remains unexpressed. In 
(1b), the agents and instruments are overtly expressed in the instrumental case. 


(1 a. stir-nam te barhih su-ta 
Strew-PERF.N.SG you.DAT.sc Barhis.NOM.N.SG press-PERF.M.SG 
indra sóma-h kr-tá dhàná át-tave 
Indra.voc.sc Soma-NOM.M.SG do-PERF.M.PL barley.NOM.M.PL eat-INF 
te hari-bhyam 
yOU.GEN.SG horse-DAT.sG 
"Ihe Barhis has been strewn for thee, O Indra; the Soma has been pressed 
(into an extract). The barley grains have been prepared for thy two 
bay-horses to eat. (Rgveda 3.35.7) 

b. nr-bhir dhü-táh su-tó asna-ih av-yo 
man-INST.PL wash-PERF.M.SG press-PERF.M.SG stone-INST.PL wool-GEN.SG 
vára-ih páripü-tah 
filter-INST.PL strain-PERF.M.SG 
‘It (the Soma) has been washed by men, pressed with the help of stones, 
strained with wool-filters’ (Rgveda 8.2.2) 


As shown in (2), the -ta form agrees with the sole (nominative) argument of intransi- 
tive verbs. This results in a difference in the marking of the subject arguments of transi- 
tive and intransitive verbs. In (1) the verb does not agree with the instrumental agentive 
arguments. In (2), in contrast, the verb sri-tah has a nominative subject soma and agrees 
with it in number and gender. 


? The Indo-Aryan branch of Indo-European inherits the deverbal result stative form with the affix -ta (al- 
lomorph -na) (reconstructed for Indo-European as *-to/-no). -ta, attested at all stages of OIA and MIA, 
attaches directly to the root, and the resulting stem is adjectival, inflecting for number and gender like any 
other adjectival forms. 
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(2) div-i somo adhi $ri-tah 
heaven-LOC.sG soma.NOM.M.SG On  rest-PERF.M.SG 


‘Soma rests (is supported) in the heaven’ (Rgveda 10.85.1) 


This resultative -ta construction (sometimes in periphrasis with tense auxiliaries) is 
the source of the ergative pattern observed in the perfective aspect in the later languages. 
In later stages of OIA, the construction was extended to marking the perfect aspect and 
it exhibited existential as well as universal perfect readings (Condoravdi & Deo 2014). 
By the time of Epic Sanskrit (late stage of OIA), the -ta construction became a frequently 
used device for marking past perfective reference. The agent argument in these cases 
is most frequently overt and marked with instrumental case. Past eventive reference is 
indicated by the presence of past referring frame adverbials like pura ‘formerly’ and tada 
‘then’. Perfective clauses containing intransitive verbs occur with nominative subjects 
(3c). All the examples below are from the Mahabharata, one of two epics that constitute 
the record for this stage of the language. 


(3) a. pura devayug-e ca eva drs-tam sarvam maya 
formerly god.age-Loc.sc and PTCL see-PERF.N.SG everything I-INsT.sG 
vibho 
lord-voc.sc 


‘Lord, formerly, in the age of the Deva (Gods), I saw everything’ 
(Mahabharata 3.92.6a; Deo 2012) 

b. hr-ta gau-h sa tada t-ena 
steal-PERF.F.SG COW-NOM.F.SG that-NOM.r.sc then he-INsT.3.sG 
prapata-s tu na tark-itah 
fall-NOM.M.SG PTCL NEG consider-PERF.M.SG 


‘Then he stole that cow, but did not consider the fall (consequences): 
(Mahabharata 1.93.27e; Deo 2012) 

c. jaratkaruh ga-tah svarga-m sahitah 
Jaratkaru.NOM.M.SG go-PERF.M.sG heaven-Acc.sc accompanied 
sva-ih pitamaha-ih 
self-INST.M.PL ancestor-INST.M.PL 


‘Jaratkaru went to heaven accompanied by his ancestors. (Mahabharata 
1.130.43c) 


The main change between Epic Sanskrit (OIA) and the later MIA stage of the language 
concerns the erosion and simplification of the rich tense-aspect system (Pischel 1900; 
Bloch 1965). Inflectional past referring forms such as the aorist, the inflectional perfect, 
and the imperfect disappeared from the language, leaving the -ta construction as the 
only past referring device.? This loss of the inflectional system has often been cited as a 
reason for the increase in the frequency and scope of the participial construction, which 


5 Traditional grammarians do provide instances of the inflectional perfect and the aorist during this period, 
but they only occur as isolated, unanalyzed forms for a few verbs like aha-'say-AoR' and akashi -‘do-aor’. 
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in turn led to the unmarking of the stative nature of the construction. The change to 
an ergative alignment was certainly complete at the Mid to Late MIA stage (Hock 1986; 
Bubenik 1998). The examples below from an archaic MIA Maharastri text Vasudevahimdi 
(ca. 500 AD) shows this ergative alignment. The verb agrees with the nominative subject 
in (4a). In (4b) the verb agrees with the nominative marked object while the agentive 
argument (‘that running one’) appears in the instrumental. 


(4) a. pat-to ya seniyo ràyà ta-m 
reach-PERr.M.sc and Seniya.NoM.M.sc king.NoM.M.sc that-Acc.sc 
paesa-m 
place-Acc.sc 
"And King Seniya reached that place? (Vasudevahimdi KH. 17.1) 

b. t-ena palayaman-ena puranakuv-o 
that-INsT.sc running-INsT.sG old-well-NoM.M.sG 
tanadabbhaparichinn-o dit-tho 
grass.covered-NOM.M.SG notice-PERF.M.SG 


‘That running one noticed an old well covered with grass’ (Vasudevahimdi 
KH. 8.6) 


Indo-Aryan diachrony after the MIA stage has often been characterized as involving a 
progressive loss of ergative alignment and gradual drift towards a nominative-accusative 
marking in perfective clauses. There are three observed ways in which the descendent 
systems deviate from the proto-ergative system of MIA: (a) Loss of ergative morphology 
in pronominal and nominal paradigms’; (b) Subject agreement (replacing or in addition 
to object agreement); (c) Accusative marking on a privileged class of objects, ie. the 
spread of differential object marking. 

It is logical to think of the implementation of any of these changes independently or 
together as the “de-ergativization” of an ergative system in analogy to the non-ergative 
portion of the grammar. Indeed, the patterns seen in individual NIA languages, such as 
suppression of overt ergative case (e.g. in Old Hindi and Marathi); nominative subjects 
(e.g. in Bangla) and agreement with overt ergative subject (e.g. in Nepali) are all analo- 
gizable to existing marking patterns in the language such as unmarked subjects, nomina- 
tive subjects, and subject agreement. However, the emergence of accusative marking on 
objects of transitive, perfective clauses poses a puzzle for a straightforward analogical 


^ In fact, data from some Early NIA languages, e.g. Hindi, reveals that the original instrumental marking 
observed on transitive subjects for the MIA ergative system is entirely lost for all nominal and pronominal 
expressions in some stages of Indo-Aryan. The ergative pattern of agreement is nevertheless retained. The 
example in (i) is from the work of Kabir, a poet from the 15th century CE. There is no overt ergative marking 
on the 3rd person subject but the agreement on the verb is with the feminine object argument (explicit or 
unpronounced) chadar ‘sheet’. 


() jo chadar sura-nara-muni odh-i 
which sheet.NoM.F.sG gods-men-sages. ÜERG Wrap-PERF.F.SG 


"Which sheet the Gods, men, and sages, all wore, (that sheet)... 
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extension narrative for de-ergativization. The puzzle arises from the evolution of case 
marking in MIA, to which we now turn. 


2.2 Syncretism in nominal case marking 


A critical change between the OIA and MIA stages, particularly in the Late MIA period, is 
the restructuring of the nominal case system. Notable here is the loss of morphological 
contrast between nominative and accusative as well as between the genitive and the 
dative cases. The syncretized set of case-endings for full nouns are given in Table 1. 


Table 1: Case-endings for full nouns. 


Singular Plural 
Nominative/Accusative  -u, a, am -a, ai 
Instrumental/Ergative -em, im, he, hi -e(h)i, ehi, ahi 
Ablative -hu, ahu, aho -hü, ahi 
Genitive/Dative -ho, aho, ha, su, ssu  -na, ha 
Locative -i, hi, him -hi 


Table 2 contains an example of inflected -a stems with the noun putta ‘son’. 


Table 2: Inflected a-stems with putta ‘son’. 


Stem Case Singular Plural 
a-stems Nominative/Accusative putt-u putt-a 
Instrumental/Ergative ^ putt-em putta-him/ehim 
Genitive/Dative putt-aho/ahu ` putta-ham 


The pronominal system retains more contrasts and syncretism between the nomina- 
tive and accusative is observed only in the plural sub-part of most pronominal paradigms. 
Table 3 (culled from Clercq 2010) provides inflectional forms for some pronominal expres- 
sions to illustrate. 

The loss of contrast between the nominative and accusative cases in most paradigms 
in a relatively free-word order language leads to heavy reliance on semantic cues from 
the linguistic material to determine grammatical relations. Consider the following ex- 
amples from the Paumacariu, an 8th century text in verse, to illustrate the syncretic 
nominative-accusative marking (glossed Now)? In (5), a sequence of parallel clauses, 


5 This is a Jaina rendition of the Epic Sanskrit text Ramayana. The edition used is the H.C. Bhayani edi- 
tion published by the Bharatiya Vidya Bhavan between 1953 and 1960. The text is available in searchable 
electronic format, input by Eva De Clercq at Ghent University. The reason for using a late MIA text is to 
identify properties of the system that is as close to the grammars of the Early NIA system as possible. 
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Table 3: Inflectional forms for pronominals. 


Stem Case Singular Plural 

1st pronoun Nominative haum amhé, amhaim 
Accusative mai(m) amhé, amhaim 
Genitive/Dative mahu, majjhu amha, amhaha 

2nd pronoun Nominative tuhum tumhé 
Accusative paim, taim tumhé 
Genitive/Dative tahu, tujjha tumha, tumhaha 

3rd pronoun. Nominative SO, SU; Sã te, tau 

MASC;FEM Accusative tam; sa te; tau 


Genitive/Dative taho, tahu; tahe taham; taham 


whether the first-occurring nominative expression realizes the grammatical subject or 
the grammatical object is determined by the meaning of the cause P In (6), the relative 
pronoun, which refers to a human participant, disambiguates the grammatical structure. 


(5) 


#kim tamu han-ai na vàlu ravi# 

QUES darkness.NoM.sc destroy-IMPF.3.SG NEG young sun.NOM.SG 

#kim valu davaggi na dah-ai vanus 

QUES young fire.NOM.sG NEG burn-IMPF.3.sc forest.NOM.sG 

#kim kari dal-ai na valu hari# 

QUES elephant.NoM.sc shatter-IMPF.3.sG NEG young lion.NoM.sc 

#kim valu na daik-ai uragamanu# 

QUES young NEG bite-1MPr.3.sc snake.NOM.sG 

"Does the young (rising) sun not destroy darkness? Does the young fire (spark) 
not burn down the forest? Does a young lion (cub) not shatter the elephant? 
Does the young snake not bite?' (Paumacariu 2.21.6.9) 


jo ghan nisi-bhoyanu ummah-ai 
who.REL.NOM.M.SG PTCL night.Loc-meal.NOM.M.SG give.up-IMPF.3.SG 


vimalattanu vimala-gottu lah-ai 
spotless.body.NoM.M.sc spotless.name.NOM.M.SG attain-IMPF.3.sG 


‘One who gives up eating in the evening (he) attains a spotless body and name’ 
(Paumacariu 2.34.8.8) 


Accusative marking is clearly visible only on first and second person singular pro- 
nouns in imperfective clauses as shown in the examples in (7). 


5 The #...4 marks clause boundaries in the sequence. 
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(7) a. suggiu deva paim sambhar-ai 
Suggiu.NoM.M.sG deva.NOM.M.SG you.ACC.SG remember-IMPF.3.SG 


‘Lord Suggiu remembers you. (Paumacariu 3.45.10.8) 
b. jaina vihana-e paim vandhav-ami 
if NEG tomorrow.Loc.sc you.Acc.sc bind-1MPr.1.sc 
‘If I do not capture you tomorrow..? (Paumacariu 3.49.20.3) 
c. jo maim  muevi annu 
who.REL.NOM.M.SG L.Acc.sc besides another.NoM.M.sc 
jayakar-ai 
adore-IMPF.3.SG 


‘(The one) who adores another one besides me...” (Paumacariu 2.25.1.9) 


Syncretism rooted in sound change is also observed between the nominative and in- 
strumental forms (the case form that gets re-interpreted as ergative when appearing with 
agentive arguments in perfective clauses) of the first and second person plural pronouns 
as in Table 4. 


Table 4: Nominative and instrumental pronominal forms. 


Aspect Person Number 
Singular Plural 

Non-perf 1 haum amhai/amhé 
Perf 1 maim amhai/amhé/amhe-him 
Non-perf 2 tuhum tumhai/tumhé 
Perf 2 taim tumhai/tumhé/tumhehim 
Non-perf 3 so te 
Perf 3 tem, tené  tehi/táham 


Despite this syncretism, agreement is uniformly with the nominative argument - with 
the nominative object in constructions based on the -ta form and with the nominative 
subject elsewhere. The examples in (8) illustrate this pattern with the first and second 
person plural pronouns amhé and tumhé. (8a) contains the syncretized pronoun amhé 
which triggers agreement in the imperfective aspect while the same form fails to trigger 
agreement in (8b). In (8c) the second person plural syncretic form used in an imperative 
clause triggers agreement while it fails to trigger verb agreement in the perfective (8d). 


(8 a amhé  jàe-va vanavasa-ho 
We.SYNCR g0-IMPF.1.PL forest.dwelling-DAT.sG 


"We are going to our forest-exile? (Paumacariu 2.23.14.3) 
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b. ki-u amhé ko  avarah-o 
do-PERF.M.sG we.SYNCR what crime-NOM.M.SG 


‘What crime have we done?’ (Paumacariu 1.2.13.9) 


c. jiha sakk-aho tiha utthar-aho  tumhé ‘Save 
in.which.way can-IMP.2.PL in.that.way save-IMP.2.PL you.SYNCR.PL 
yourselves in the way that you can’ (Paumacariu 5.82.12.4) 


d. tumhé jam cint-iu tam 
you.SYNCR.SG what.REL.M.sc think-PERr.M.sc that.CORREL.M.sG 
hü-a 
happen-PERF.M.SG 


‘That, which you thought (would happen), happened. (Paumacariu 3.47.9.6) 


These patterns of syncretization within the nominal inflectional system of MIA are dif- 
ficult to reconcile with a story in which there is a straightforward extension of an existing 
alignment pattern in the language to a marked sub-system of the grammar. Although 
there is a contrast between the nominative and accusative cases in MIA, it is exhibited 
only in selected parts of the pronominal system (a subset of the singular pronouns) and 
therefore seems to be rather weak evidence for extending the accusative marking pattern 
to ergative clauses. A reviewer argues that the regular presence of such a case-marking 
pattern in imperfective clauses, however limited in terms of its application, should not 
be seen as “weak” evidence for a nominative accusative pattern. I concede that it is in- 
deed theoretically possible that the pattern observed in a small subset of imperfective 
non-ergative clauses gets extended to perfective, ergative clauses. However, neither ex- 
isting grammars of MIA (Pischel 1900; Vale 1948; Clercq 2010) nor an examination of the 
textual data indicate any presence of accusative marked object arguments in perfective 
transitive clauses at this stage in the language. Even pronominal objects (9a)-(9b) and 
human-denoting full noun phrase objects (9c)-(9d) of canonical transitive verbs, which 
obligatorily appear with overt accusative marking in the NIA languages, are uniformly 
marked nominative at this stage. 


(9 a. haum  nikkàrane ghall-iya ram-em 
I.NoM.sG without.reason drive.out-PERF.F.SG Ram-ERG.SG 


‘Ram drove me out (of Ayodhya) without any reason? (Paumacariu 5.81.13.8) 


b. cakkesar-ena kema tuhü di-tthi 
Cakkesara-ERG.M.sc how you.NOM.SG see-PERF.F.SG 


‘How were you noticed by Cakkesara (Ravana)?' (Paumacariu 2.4.2.1.5) 
c. vinivar-iu ravanu rahav-ena 
dissuade-PERF.M.SG ravana.NOM.M.SG rahava-ERG.M.SG 


‘Rāhava (Rama) dissuaded Ravana’ (Paumacariu 4.66.14.6) 


7 Thus, there are no positive instances with pronominal forms maim, taim, tam etc. being used instead of 
haum, tuhum, or so/su etc. in ergative clauses with pronominal objects at even the latest stages of Middle 
Indo-Aryan. 
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d. di-tthu janaddanu rahavacand-em 
see-PERF.M.SG janaddana.NOM.M.SG rahavacanda-INs.sG 


‘Rahavacanda saw Janaddana. (Paumacariu 2.29.8.1) 


Moreover, no language of the later stage (Early NIA) has an ergative-accusative mark- 
ing pattern which uses the pronominal forms of late MIA in ergative clauses that ac- 
cusative marking on objects. While the issue needs to be more closely investigated, it 
seems reasonable to look for an alternative source for accusative marking in ergative 
clauses than the template offered by MIA. 


3 Differential object marking: A New Indo-Aryan 
innovation 


The previous subsection established that accusative marking of the MIA variety is both 
weakly present and shows no evidence of being extended to perfective ergative clauses 
at later stages of Indo-Aryan. This leaves the possibility that the incidence of object 
marking in ergative clauses - a pervasive phenomenon in the Modern NIA languages - 
begins with the Differential Object Marking pattern - which is considered to be an NIA 
innovation. Differential Object Marking (henceforth DOM) in Indo-Aryan languages is 
sensitive to animacy and referentiality features of arguments. It is obligatory on 1st and 
2nd pronominal objects, and on 3rd person animate-denoting pronominals. It is optional 
with animate-denoting full NPs where the absence of object marking correlates with a 
non-referential interpretation of the NP. In the Modern NIA languages, this semantically 
driven pattern of object marking does not distinguish between ergative and non-ergative 
clauses; i.e. the case marking on objects is entirely independent of any overt or covert 
presence of case on the subject. 

Logically, one can imagine two ways in which an ergativity-insensitive object mark- 
ing pattern can emerge in a system. It could be that the DOM pattern first emerges in 
Late MIA or Early NIA in non-ergative clauses. Such a pattern is then later extended 
analogically to ergative clauses as part of the de-ergativization trajectory characteriz- 
ing Indo-Aryan diachrony. The second possibility is for the DOM pattern to emerge 
simultaneously in both ergative and non-ergative clauses and gradually extend to dif- 
ferent classes of verbs. On this latter scenario, the presence of DOM in ergative clauses 
is not part of the larger de-ergativization trajectory that characterizes NIA diachrony, 
but rather attributable to independent developments that introduce overt marking on 
direct objects into the case system.? The empirical facts of Late MIA and Early NIA texts 
support the second scenario. In what follows, I will suggest that the emergence of DOM 
in both ergative and non-ergative clause types of MIA amounts to the extension of an 
inherited OIA marking pattern observed with the class of so-called “double object" verbs. 


8 The effects on agreement in languages which exhibit such a changed case-marking pattern may be different. 
In Modern NIA we see both default agreement in ergative clauses when both subject and object are case- 
marked (e.g. in Hindi, Marathi) or continued object agreement despite overt accusative marking on the 
object (e.g. in Gujarati, Marwari). 
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3.1 Double object verbs in Old Indo-Aryan 


A class of verbs in OIA exhibits a double object pattern in which the theme or goal and 
another participant of the denoted event are marked in the accusative case. Semantically, 
this is a diverse class and includes at least the subclasses in Table 5. 


Table 5: Double object verbs in Old Indo-Aryan. 


Class Verbs 
Verbs of speaking bra ‘speak’, vac ‘say’, kath ‘tell’ 


Verbs of asking prech ‘ask’, yac ‘request, solicit’, bhiks ‘beg’, prarth ‘plead’ 
Verbs of teaching upa-dis ‘teach’, anu-sas” ‘teach’, a-dis ‘direct’ 


Causatives of some  khdd-aya ‘cause to eat’, pà-yaya ‘cause to drink’, dars-aya 


transitives ‘cause to see’, śrāv-aya ‘cause to hear’ 
Miscellaneous ji ‘win, duh ‘milk’, dand ‘punish’, ni ‘lead’ 
ditransitives 


(10) contains examples from OIA (Epic Sanskrit) involving verbs of speaking in imper- 
fective, non-ergative clauses. In (10a), the pronominal denoting the addressee if the verb 
of speaking event tvam is accusative as is the information communicated, nidarsanam 
‘the teaching’. (10b), from a proximal location in the text, is similar. 


(10 a. atas tvà-m kathay-e karna nidar$an-am 
hence you-Acc.sc tell-1MPr.1.sc karna.voc.sc teaching-Acc.N.sG 


idam punah 
this.Acc.N.sG again 


"Hence, O Karna, I tell you this teaching (advice) again? (Mahabharata 
8.28.8e) 

b. $alyo *brav-it punah karn-am 
Salya.NOM.M.SG speak-IMPFCT.3.sG again karna-ACC.M.sG 


nidarsan-am udahar-an 
teaching-ACC.N.SG announce-PART.NOM.M.SG 


‘Salya again spoke out his advice to Karna’ (Mahabharata 8.28.1c) 


An alternative realization for pronominal animate-denoting higher arguments of dou- 
ble object verbs is as DAT/GEN clitics. 


(11) a. hanta te kathay-isy-àmi nam-ani iha 
PTCL you.DAT/GEN.CL tell-FUT-1.sG ^ name-ACC.N.PL here 
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manisi-nam 
wise-one-GEN.M.PL 
‘Ah, I will tell you the names of the wise ones. (Mahabharata 1.48.4a) 
b. (Gate bhagavan ekah saty-am 
reign-IMPF.3.sc Lord.NOM.M.SG alone.NOM.M.SG truth.ACC.N.sG 
etad brav-imi te 
this.ACC.N.sG speak-IMPr.1.sG you.DAT/GEN.CL 


"Ihe Lord alone reigns [over time and death and this universe of mobile and 
immobile objects], this truth I tell you? (Mahabharata 5.66.13c)? 


In ergative, perfective clauses, this higher argument may surface variably: either as 
the nominative subject of the passivized verb form (examples in (12)) or as a DAT/GEN 
marked clitic pronoun (examples in (13)).!° 


(12) a. uk-to ratr-au mrg-air as-mi 
speak-PERF.M.SG night-Loc.sc animal-INST.PL be-IMPF.1.sG 
‘I was spoken to by the beasts at night! (Mahabharata 3.244.11a) 
b. ta-yà..  é$r-àv-ito vacan-àni sah 
she-INs.sG hear-CAUS-PERF.M.SG word-Acc.N.sc he.NoM.sc 
‘He was made to hear (these) words by her? (Mahabharata 2.2.62) 
c. sa maya varadah kam-am 
he.NOM.M.SG Lins.sc boon.granting.NOM.M.sG desire-ACC.M.SG 
yac-ito dharmasamhit-am 
solicit-PERF.M.SG virtue.bound-Acc.M.sG 


‘He, the boon-granting one, was solicited by me for (fulfilling my) virtuous 
desire? (Mahabharata 1.78.3c) 


(13) a. samkhyadarsan-am etavad uk-tam te 
samkhyadarsan-NoM.N.sG so far speak-PERF.N.sG you.DAT/GEN.SG 
nrpasattama 
best.king.voc.sc 
‘Thus far, the SamkhyadarSana was spoken to you, O best of kings: 
(Mahabharata 12.295.1a) 

b. tad etat kath-itam ^ sarv-am mayà vo 
thus, this.NOM.N.SG tell-PERF.N.SG all-NOM.N.SG LINS.SG you.DAT/GEN.PL 
munisattamah 
great.sage.VOC.PL 


‘Thus, I have told you all this, O great sages? (Mahabharata 1.20.12a) 


? The previous line of verse completes the translation: kálasya ca hi mrtyos ca jangamasthavarasya ca (Ma- 
habharata 5.66.13) 

10 In (12a), the passivized subject is covert and the nominative case marking of the pro-dropped subject is 
inferred from the agreement on the auxiliary verb asmi. 
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c. upadis-to hi me pitr-à yogo 
teach-PERF.M.SG PTCL I.DAT/GEN.CL father-1INsT.3.sG method.NOM.M.SG 
*nika-sya bhedan-e 
array-GEN.M.SG penetration-LOC.N.SG 
"Ihe method of penetrating into this (military) array has been taught to me 
by my father? (Mahabharata 7.34.19a) 


d. brahmacary-am idam bhadr-e mama 
celibacy-NoM.N.sG this good.lady-voc.sc I.GEN.sG 
dvadaSavarsik-am dharmaraj-ena ca adis-tam 
twelve.years-NoM.N.sc Dharmaraja-INs.sc and command-PERF.N.SG 


‘Good lady, this twelve-year celibacy has been commanded of me by 
Dharmaraja’ (Mahabharata 1.206.21a-c) 


The argument realization pattern illustrated in (11) and (13), where the higher argu- 
ment of a double object verb surfaces with dative or genitive marking in both ergative 
and non-ergative clauses, is fairly robust in OIA. The alterations to the nominal case sys- 
tem in MIA described in Section 2.2, have no effect on this pattern, since the syncretized 
DAT/GEN remains available for overt marking throughout the period. Crucially, given 
the organization of the MIA case system, this dative/genitive marking is the only reli- 
ably present overt marking on non-subject arguments in both ergative and non-ergative 
clauses at this later stage. Based on the data from MIA, it seems most reasonable to con- 
jecture that this template triggers the reanalysis of DAT/GEN as accusative marking on a 
subset of direct objects. 


3.2 Double object verbs in Middle Indo-Aryan 


In (14) are given examples of the OIA double object verbs in their MIA incarnations. 
Notice that themes surface with the syncretized nominative-accusative case (glossed 
NOM) while the non-theme higher argument (the addressee of the speech verb in (14a)- 
(14b) and the causee in (14c)) appear with the syncretized DAT/GEN marking.” 


(14) a. sabbhav-em ` rama-ho kah-ai ema 
goodwill-INs.sc rama-DatT/GEN.SG tell-IMPr.3.sc this.NOM.N.sG 
‘He said this to Rama with goodwill’ (Paumacariu 2.40.13.7) 


b. marui kah-ai vatta valadeva-ho 
Marui.NoM.sG tell-ImpF.3.sG news.NOM.SG valadeva-DAT/GEN.SG 


"Maruti told the news to Valadeva’ (Paumacariu 3.55.9.1) 
c. ta-ho daris-àv-ami ajju jamattanu 
he-DAT/GEN.SG see-CAUS.IMPF.1.SG now yama.prowess.NOM.N.SG 


‘Now, I will show him the prowess of Yama (the god of death). (Paumacariu 
1.11.10.6) 


H (14a) and (14c) have subject pro-drop. 
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A look at perfective, ergative clauses in MIA containing double object verbs reveals 
overt DAT/GEN marking on the non-theme argument and unmarked themes. (15a) con- 
tains the causative of a perception verb, while (15b)-(15c) contain verbs of speaking. Just 
like OIA, there is no difference between ergative and non-ergative clauses vis-a-vis the 
realization of non-subject arguments. 


(15) a. pad-e padima... siya-he... daris-àv-iya 
Screen-LOC.sG image.NOM.F.SG Sita-DAT/GEN.SG See-CAUS-PERF.F.SG 
bhàmandala-ho 
Bhamandala-DAT/GEN.SG 
‘(He) showed the image of Sita on a screen (painting) to Bhamandala’ 
(Paumacariu 2.21.8.9) 

b. kah-iu asi ma-hu parama-jinind-em 
tell-PERF.M.SG be.PsT.3.sc LDAT/GEN.sG great-Jinendra-ERG.M.sG 
"Ihe great Jinendra told (this) to me? (Paumacariu 1.1.12.8) 


c. ta-ho maim parama-bheu ehu 
yOu.DAT/GEN.SG LERG.sG great.secret.NOM.SG this.NOM.sG 
akkh-iya 


tell-PERF.N.sG 


‘T have told you this great secret? (Paumacariu 1.16.8.9) 


In addition to the non-theme arguments of double object verbs, the syncretized poar! 
GEN marking also appears on possessor and goal arguments of standard ditransitives 
(examples in (16)) and on themes of verbs that describe a reciprocal experience (examples 
in (17)). 

(16) a. kikkindha-ho ghall-iya mala tae 
kikkindha-DAT/GEN.sG put-PERF.F.sc garland.r.sc she.ERG.SG 
‘She garlanded Kikkindha (lit. put a garland on)’ (Paumacariu 1.7.4.1) 

b. paripes-iu lehu pahana-ho anaranna-ho 
send-PERF.M.SG letter.NOM.M.SG chief-DAT/GEN.sG Anaranya-DAT/GEN.SG 
ujjha-he rana-ho 
Ayodhya-pat/GEN.sG king-DAT/GEN.SG 
‘(He) sent a letter to Anaranya, the king of Ayodhya’ (Paumacariu 1.15.8.4) 


c. angutthala nav-evi samapp-iu tavahn mahu 
finger.ring.NOM.M.sG bow-GER hand-PERF.M.sc then — LDAT/GEN.sG 
cüdàmani app-iu 
precious.gem.NOM.M.SG give-PERF.M.SG 


(After) I handed her the finger ring, having bowed to her, (she) gave me this 
precious gem. (Paumacariu 3.55.9.7) 
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d. dinna kanna maim dasaraha-tanay-aho 
give-PERF.F.sG daughter.NoM.r.sc Lrnc.sc dasaraha-son-DAT/GEN.SG 


‘I have given my daughter to the son of Dasaraha (Dagaratha)? (Paumacariu 
2.21114) 


(17) a. salil-u samudd-aho jiha milai 
water-NOM.SG OCean-DAT/GEN.SG as  meet-IMPF.3.SG 


‘Just as the water meets the ocean’ (Paumacariu 3.56.1.12) 

b. tàvehfi gayana-ho oar-evi afijana-he vasantamala 
then, sky-ABL descend-GER Afijana-DAT/GEN.SG Vasantamala.NOM 
mil-iya 
meet-PERF.F.SG 


‘Then, having descended from the sky, Vasantamala met Afijana’ 
(Paumacariu 1.19.8.10) 


Critically, the syncretized DAT/GEN marking is the only reliable signal of non-subject 
arguments in MIA and it appears without discernible difference in distribution in both 
ergative and non-ergative clauses. It does not however appear, for the most part, on 
theme/patient arguments of canonical transitive or ditransitive verbs — animate or other- 
wise. (18a)—(18b) are examples of ergative clauses with animate-denoting subjects while 
(18c) contains a non-ergative clause. 


(18) a. ha vahue-vahue map bhantiy-ae tuhü 
alas bride.voc ` LERG.sc unthinking-INST.sSG you.NOM.sG 
ghall-iya aparikkhantiy-ae 
drive.out-PERF.F.sG without.testing-ERG.F.SG 
"Alas, O bride, I drove you out without testing you in any way: (Paumacariu 
1.19.15.7) 

b. ni-u tihuana-paramesaru tettahe sapparivaru 
take-PERF.M.sG three.worlds.lord.NoM.M.sc there ` with.family.NOM.M.sc 
purandaru jettahe 
purandara.NOM.M.sc where 
‘(She) took the lord of the three worlds there where Purandara was with his 
family? (Paumacariu 1.2.2.8) 

c. munivara ghall-es-ai rajjesar-u 
sage.NOM.M.PL drive.out-FUT-3.sc king-NOM.sG 


"Ihe king will drive out the sages’ (Paumacariu 2.35.9.1) 


3.3 The emergence of DOM 


The key suggestion I make here is that the Indo-Aryan differential object marking pat- 
tern emerging between late MIA and Early NIA amounts to the generalizing reanalysis of 
syncretic DAT/GEN marking on non-subject non-theme arguments as accusative marking 
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on (a privileged class of) objects. The data that provide evidence to enable such a reanal- 
ysis are clauses containing double object and other ditransitive verbs which either have 
implicit (non-overt) theme arguments or where the arguments (in the case of verbs of 
speech) are propositional. Such clauses are not very frequent but they do occur quite 
reliably in MIA. Examples of non-ergative clauses are given in (19) and ergative clauses 
are in (20). 


(19) a akkh-ai siya samirana-putt-aho 
tell-IMPr.3.sc Sita.NoM.sc Samiranaputta-DAT/GEN.SG 
‘Sita told Samirana-putta (this)? (Paumacariu 3.50.10.7) 
b. kahai maharisi gayana-gai 
Say-IMPF.3.sG great.sage.NOM.M.SG sky.traveling.NOM.M.SG 
taho lavan-aho samar-e samatth-aho 
that.DAT/GEN.sG Lavana-DAT/GEN.SG battle-Loc.M.sc capable-DAT/GEN.SG 


‘The great sage said to that Lavana, who was capable in battle (thus). 
(Paumacariu 5.82.8.9) 


(20) a. atthavaya-giri-kampavan-aho padihàr-em akkh-iu 
eight.regions.trembling-DAT/GEN.sG messenger-ERG.SG tell-PERF.M.SG 
ravan-aho 


ravana-DAT/GEN.SG 
‘The messenger told (this) to Ravana, who was capable of causing the eight 
territories (astapada) to tremble’ (Paumacariu 1.15.4.1) 

b. to paminipura-paramesar-aho daris-àv-iya 
then paminipura-lord-DAT/GEN.sG see-CAUS-PERF.M.PL 
vijaya-mahihar-aho 
vijaya-king-DAT/GEN.SG 
‘They showed (the boys) to the lord of Paminipura (Padminipura), the king 
Vijayaparvata. (Paumacariu 2.33.2.1) 

c. afijan-ahe samapp-iu jàya dih-i 
Afijanà-DAT/GEN.sG hand-PERr.M.sc birth day-Loc.sc 
‘They handed him (the baby Hanuman) to Afijanà on the day of his birth’ 
(Paumacariu 1.19.11.6) 


In clauses such as those in (19) and (20), the only overt non-subject argument carries 
DAT/GEN marking. Moreover, this pattern of marking does not differentiate between 
whether the subject carries ergative marking or is unmarked (nominative). 

Consider a learner that must arrive upon the case inventory of a language based on 
the observable input. The MIA system provides reliably present morphological evidence 
for nominative, ergative, and dative/genitive case but no reliable evidence for accusative 
case. It also provides robust data in which the only non-subject argument overtly ex- 
pressed in a clause carries case marking (the syncretic DAT/GEN marking). It is possible 
that the learner takes this subset of data as evidence for extending the DAT/GEN marking, 
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reserved for non-theme arguments, to theme and patient arguments as well. The Differ- 
ential Object Marking pattern evidenced in Early NIA emerges because the analogical 
extension of the overt DAT/GEN marking is constrained by the semantic properties asso- 
ciated with the original class of arguments marked by it — animacy and referentiality. If 
this hypothesis is correct, then we expect that there may be early data supporting this 
extension of DAT/GEN case marking to direct objects - in effect, the reanalysis of dative 
marking as accusative case, restricted to arguments meeting the criteria of high animacy 
and referentiality. 

In the previous subsection, it was claimed that as far as the MIA stage is concerned, 
direct arguments of canonical transitive verbs do not, for the most part, surface with 
DAT/GEN marking (examples in (18)). The caveat was provided precisely because the MIA 
stage itself seems to exhibit some data which is possibly analyzable as emergent DOM. 
The tentativeness with which this claim can be made emerges from three uncertainties 
about the data: (a) Although the lexical verbs appearing with the DAT/GEN marked ob- 
jects arguably have an argument structure corresponding to transitive verbs and their 
translational equivalents in English are realized as canonical transitives, given the se- 
mantics of these verbs, it seems possible that they pattern either with ditransitives or 
with “reciprocal” verbs” or with intransitives having accusative goal arguments in San- 
skrit. Thus, it is necessary to investigate more closely whether these cases are early 
DOM- instances or whether they should be reclassified as exhibiting previously occur- 
ring patterns (b) The object-marking pattern is very infrequent outside of the class of 
double-object verbs, other ditransitives, and “reciprocal verbs”. (c) There is absolutely 
no example of perfective clauses with ergative subjects in which the object appears with 
DAT/GEN marking. 

It is possible therefore that the human-denoting DAT/GEN marked NPs in the data 
below are not the theme/patient arguments in a standard transitive template as they 
appear to be; they may be better analyzed as recipient or goal arguments. I will leave 
the adjudication of this issue for further research. But regardless of their status, they 
provide further surface evidence to the language acquirer for an object marking case 
"accusative" in the language. 

In (21) and (22), we see that the human-denoting non-subject arguments of the tran- 
sitive verbs khama ‘forgive’, pekkha ‘look at’, garaha ‘denounce, curse’, abhitta ‘attack’, 
dhukka ‘approach’ and bhid ‘battle’ appear with DAT/GEN marking. The examples in (21) 
contain non-perfective clauses ((21b) is an imperative) while those in (22) illustrate the 
argument realization pattern in perfective clauses. 


(21 a. ekkavàra ma-hu khama-hi bhadar-a 
one.time I-DAT/GEN.sG forgive-IMP.2.G warrior-voc.sc 


‘O warrior (Lakshmana), please forgive me one time’ (Paumacariu 3.44.4.7) 


b. sundari pekkhu pekkhu jujjh-ant-aho 
beautiful.one.voc.sc see.IMP.2.sG see.IMP.2.sG fight-PART.DAT/GEN.SG 


*O beautiful one, look at the battle? (Paumacariu 2.31.12.3) 
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c. ema jàma garah-anti jinind-aho asanu 
thus when denounce-1MPF.3.Pr jininda-DAT/GEN.sG seat.NOM.M.SG 
cal-iu tàma dharanind-aho 
shake-PERr.M.sc then dharaninda-DAT/GEN.SG 


"When they were denouncing Jininda thus, the seat of Dharaninda started to 
shake: (Paumacariu 1.2.14.5) 


d. ham abbhitt-ami düsan-aho 
I.NoM.SG attack-1MPr.1.sc Düsana-DAT/GEN.sG 


‘I will attack Düsana' (Paumacariu 2.40.4.10) 


(22) a. dha-iu ankusu lakkhan-aho abbhi-ttu 
run-PERF.M.SG ankusu.NOM.M.SG laksmana-DAT/GEN.sG attack-PERF.M.SG 
lavanu ran-e ram-aho 


lavana.NoM.M.SG battlefield-Loc.sc rama-DAT/GEN.SG 


'Ankusa ran to Lakshmana (while) Lavana attacked Rama’ (Paumacariu 


5.82.14.13) 

b. kattha vi bhad-aho sivangana 
some.place PTCL warrior-DAT/GEN.sG she-jackal-group.NOM.M.PL 
dhukk-iya 


approach-PERF.M.PL 


‘At some places (on the battlefield), she-jackals approached the (dead) 
warriors. (Paumacariu 1.17.13.8) 
c. indai bhid-iu samar-e 
battle-PERF.M.sG battlefield-Loc.sc 
hanuvant-aho 
Indai.NoM.M.sc hanuvant-DAT/GEN.SG 


‘Indai (Indrajit) battled with Hanuvanta in the battlefield? (Paumacariu 
3.53.10.9) 


3.4 The DOM pattern in Early New Indo-Aryan 


Turning to the Early New Indo-Aryan stage (illustrated here with Old Marathi), we see 
a clearly established animacy- and referentiality-sensitive DOM pattern in both ergative 
and non-ergative clauses from the earliest period. The syncretic pAT/GEN marking of 
MIA appears as a generalized oblique case and it is augmented with innovated postpo- 
sitions that correspond to accusative and dative case markers. This trajectory, in which 
the MIA case-system reduces to a nominative/oblique contrast and new postpositions are 
innovated to convey the semantic and structural information associated with the older 
cases, is shared across Indo-Aryan languages (Masica 1991; Bubenik 1996; 1998 a.o). 


12 This period is represented here by two texts — Lilacharitra (ca. 1286 CE, prose) and the Dnyaneévari (ca. 
1287 CE, verse). 
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Direct objects in Old Marathi surface with an innovated postpositional accusative 
clitic, -tem, attached to the oblique stem (the reflex of the MIA DAT/GEN marker). The ex- 
amples selected for presentation here contain transitive verbs whose animate-denoting 
theme arguments in both ergative and non-ergative clauses appear with overt accusative 
marking in (23)-(25). 


(23) a. àmhim tuma-tem ne-unum 
I.NOM.PL you.PL-ACC take-FUT.1.PL 
"We will take you (to Varanasi). (Lilacaritra 1.25) 

b. aisem mhan-auni yà-tem Srikari-m ` dhar-üni apuleya 
thus speak-cER this.oBr-Acc hand-1ns.sc hold-ceEr sell opt 
ghara=si ne-lem 
house.OBL=DAT take-PERF.N.SG 


"Having spoken thus, taking him by the hand, she took him to her house: 
(Lilacaritra 1.34) 


(24) a. mhanoni prakasa=ce=ni=hi dehabal-em na dekh-ati 
therefore light.oBL=OF=BY=PTCL strength-INS.sG NEG see-IMPF.3.PL 
ma-tem 
LoBL-AcC 


‘Therefore, even by the strength of light, they do not see me? (DnydneSvari 
7.25.158) 


b. tehim yam=tem upariye-varauni dekh-ilem 
he-ERG.SG this.M.sG-Acc upper.storey.oBL-from.top see-PERF.N.SG 
‘He saw this one from the upper story (of the house)? (Lilacaritra 1.6) 
(25) a. ani te ama-tem dhari-ti 
Andthey.NoM.PL we-AcC — catch-IMPF.3.PL 
‘And they (honorific) would catch us? (Lilacaritra 1.18) 
b. eki-m akas-im sūryā=teņm dhar-ilem 
one-ERG.SG sky-Loc.sc sun.OBL=ACC catch-PERF.N.SG 


‘Someone (might) catch the sun in the sky.’ (Dnydnesvari 10.0.37) 


The examples in (26) contain the same non-animate denoting but referential argument 
jaga ‘world’ that also receives accusative marking in both imperfective and perfective, 
ergative clauses ((26a) and (26b) respectively). 


(26) a. maga apu-lem kelem phokar-iti ani jaga=tem 
then self-GEN.N.sc deed.NoM.N.sc proclaim-ImPF.3.PL and world-Acc 
dhikkar-iti 
denounce-IMPF.3.PL 


‘Then they proclaim their own deeds and denounce the world? (Dnyanesvari 
16.10.328) 
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b. prabamdhavyaj-em jaga-tem raks-ilem jana 
literary.work.IiNsT.sc world-Acc save-PERF.3.N.SG know.IMP.2.SG 


‘Know that (the Guru) has saved the world through this literary work? 
(Dnyane$vari 18.78.1765) 


It is necessary to take a much closer look at the pattern of DOM seen in Old Marathi 
languages and compare it on a verb-by-verb and argument-type by argument-type basis 
with the MIA pattern. It is only such an investigation that can accurately establish the 
nuanced differences between the impoverished accusative marking of Late MIA and the 
innovated accusative marking of Old Marathi. Noteworthy is the fact that no reflexes 
of the MIA accusative marking survive in the pronominal system of Old Marathi; only 
traces of the syncretized GEN/DAT marking remain. 


4 Conclusion 


At first glance, the presence of accusative marking (DOM) in NIA ergative clauses could 
be considered to be a case in which an existing template from the imperfective domain 
is extended by analogy to the perfective ergative domain. However, a closer study of 
the case-marking patterns of Late MIA reveals that there is no evidence for any direct 
extension of the MIA accusative marking to ergative clauses. It is more likely the case 
that the DOM pattern emerges in NIA languages as a reanalysis of the MIA DAT/GEN 
marking that appears systematically on a specific subset of non-subject arguments into a 
marker of accusative case. This reanalyzed accusative case is attested in both ergative and 
non-ergative clauses in the earliest texts of Old Marathi, supporting the hypothesis that 
accusative marking in ergative clauses is not part of any “de-ergativization” trajectory 
in the history of Indo-Aryan but rather an emergent effect of across-the-board changes 
in argument realization options for the languages. 


Abbreviations 

Glosses are as follows. “-” stands for a morpheme boundary, “=” for a clitic boundary. 
ABL ablative IMPF  imperfective 

ACC accusative (Old Indo-Aryan Present) 
AOR aorist IMPFCT Old Indo-Aryan Imperfect 
DAT dative INF infinitive 

ERG ergative INST instrumental 

F feminine LOC locative 

FUT future M masculine 

GEN genitive N neuter 

GER gerund NEG negation 

IMP imperative NOM nominative 
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16 On mechanisms by which languages become [nominative-Jaccusative 


PASS passive PROG progressive 

PERF perfective PV verb particle 

PFCT perfect SG singular 

PL plural SYNCR  syncretic (NOM/INST) 
PTCL discourse particle voc vocative 


PTCPL participle 
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Chapter 17 


OF as a phrasal affix in the English 
determiner system 


Randall Hendrick 
University of North Carolina at Chapel Hill 


This paper examines the distribution of of in the complex determiner system of English 
nominal expressions. The hypothesis is advanced that of in degree phrase inversion con- 
structions (expressions like more influential of a book) is a phrasal affix in the sense of 
Anderson’s classic work asserting the autonomy of morphological theory. The phrasal affix 
analysis can be distinguished from syntactic accounts that treat of as a syntactic head of 
phrase. It is argued that the phrasal affix account can capture of e sensitivity to morpholog- 
ical sub-category features and to second position more naturally than the syntactic head of 
phrase analysis. In contrast to the syntactic head analysis, the phrasal affix analysis also 
has the advantage of explaining why of fails to exhibit typical syntactic properties such as 
selection, dislocation, conjunction, and adjunction of focus particles. The analysis extends 
neatly to the use of of in complex determiners involving fractions. 


1 Introduction 


Anderson (1992) has argued that the classical distinction between words and affixes can 
be profitably extended to the phrasal domain, a line of argumentation that is extended in 
Anderson (2005). On this view, some properties of constructions might be better under- 
stood if analyzed as involving affixes attached to phrases, parallel to the attachment of 
affixes to words. Anderson identifies the possessive ’s as the prototype of such a phrasal 
affix, but (special) clitics more generally are identified as phrasal affixes in this sense. 
In the same spirit of enlarging the explanatory work of morphological theory into the 
domain of phrasal syntax, Anderson (2005) argues carefully that some verb second phe- 
nomena should be given a morphological explanation parallel to second position clitics. 
Both claims have proven provocative because they seem to compete with other rather 
popular analyses that make use of syntactic movement.! In this paper I will argue that of 


! The true difference between these types of analyses may be less than meets the eye if one assumes a 
syntactic engine like that advocated in Chomsky (1995). There is a real substantive difference in the embrace 
of optimality theoretic explanations in Anderson (2005), but I will ignore that issue here in the belief that 
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as it occurs in the pre-determiner system of English provides compelling corroboration 
for the attempt to extend the range of morphological analysis into at least some areas of 
phrasal syntax because alternative syntactic explanations for these instances of of are 
relatively weak. 

The center of our attention is on the formative of as it occurs in sentences like (1)- 
(2). These sentences appear to have some string before the indefinite D a(n) as part of 
the bracketed nominal constituent. They have been the focus of attention in a number 
of works, including Bolinger (1972), Bresnan (1973), Hendrick (1990), and Kennedy & 
Merchant (2000), among others. Kim & Sells (2011) present a wide variety of naturally 
occurring examples of the construction. A range of proposals have been offered to pro- 
vide syntactic accounts of such sentences. Such sentences seem closely related to (4), in 
which the adjective abuts a directly without the cushioning of of, and (3), in which the 
adjectival constituent appears to the right of a rather than to its left. Kennedy & Mer- 
chant (2000) label (4) and (1) as inverted DegP (i.e., inverted degree phrases), suggesting 
that they originate as (3) and subsequently move to a higher position. 


(1 


) It is difficult to find [ more influential of a book ] 
(2) [How long of a vacation ] did she take? 
) 

) 


— 


3) It is difficult to find [ a more influential book ] 
(4) It is difficult to find [ more influential a book ] 


The degree phrases that are implicated in this inversion are semantically fairly het- 
erogenous: a great many, like (1), appear to be evaluative in the sense of Keenan & Faltz 
(1985), but others, like (2), are extensional and amenable to a model theoretic interpreta- 
tion. 

There is another family of constructions that exhibit of preceding indefinite [p a(n)]. 
These are the complex determiners that have figured prominently in the study of the 
semantics of determiners and general quantifiers. Keenan & Stavi (1986) noted a range 
of complex determiners as part of their attempt to establish the semantic conservativity 
of a large set of determiners. Many of those determiners include of to the left of a, and 
denote proportions or frequencies, as noted in Peters & Westerstáhl (2006). The examples 
in (5) list some of the complex determiners involving of ? 


(5) a They sang for half of an hour. 
b. Sheate two thirds of an orange. 


c. The tornado damaged one quarter of an entire neighborhood. 


the existence of phrasal affixes is conceptually independent of any commitment to optimality theoretic 
explanations. 

? Peters & Westerstahl (2006) observe sentences like two of ten interviewed persons have not answered the 
question. The analysis in the text does not generalize to such an example because it does not exhibit [p 
a(n)]. Unlike the examples examined in the text, a plural NP is involved here as well. The of in this example 
would have to be accounted for separately. The fact that such examples can occur with out before of in 
contrast to (1) could be evidence supporting this inference. 
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My principal claim is that of is inserted in (6) as an affix to D, formally marking the 
relation between the head D and its specifier when o is phonologically non-null. 


(6) DP 


Ber ee 
DegP D 


This affixed of is evinced in all of (1), 2 and (5). As a marker of a formal relation, 
it itself is not semantically evaluated, and in this sense, it is not meaningful. As an 
adjunct, It is also syntactically inert for movement processes that target specifiers, heads 
or complements.? 

I have framed my claim in terms of the DP hypothesis in (6), where D is the head of 
the entire nominal phrase. If one were instead to assume the NP hypothesis, where N is 
the head of the nominal phrase, we would have structures like (7). In this case we would 
say that of is adjoined as an affix of D, formally marking the relation between D and the 
phrase a to its left when o is non-null. 


(7) NP 


2 OF as a phrasal affix 


There is a prima facie case to be made that of in examples like (1) and (5) are phrasal 
affixes. The insertion of of is sensitive to morphological properties and shows second 
position effects common to other putative examples of phrasal clitics that Anderson 
identifies. 


2.1 OF as sensitive to morphological properties 


The inversion of degree phrases in examples like (1) does not apply generally. It is un- 
natural if the degree phrases are an instance of unmodified synthetic comparatives, as 
illustrated in (8). 


3 See Kaufman (2010) for a broadly similar analysis of Tagalog and other Austronesian clitic systems. In 
lexical theories of the sort that Anderson has generally favored, the syntactic inertness of the phrasal clitic 
will follow from the post-syntactic affixation of the phrasal clitic. 
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(8) a Weare shopping for a better/prettier/finer vase. 
b. * We are shopping for better/prettier/finer a vase. 


c. * We are shopping for better/prettier/finer of a vase. 


When synthetic comparatives are modified, inversion of degree phrases, as in (9), is 
acceptable for some speakers, although others reject the inversion of degree phrases 
even when modified. 


(9) a. * We have never seen a any better/prettier/finer vase. 
b. We have never seen any better/prettier/finer a vase. 


c. We have never seen any better/prettier/finer of a vase. 


Speakers who reject all synthetic comparatives in (8)-(9) treat inversion of degree 
phrases as sensitive to that morphological class. Speakers who accept (9) in contrast to 
(7) appear to treat the inversion of degree phrases as sensitive to the prosodic contour 
of the phrase. Degree phrases composed of a single prosodic word, as in (8), are ex- 
cluded. When more prosodic material is added, the same synthetic comparatives appear 
acceptable once again, as in (9). 

The inversion is also sensitive to the morphological properties of the D to its right. 
The D cannot be definite, as shown by (10b) and (11b). Of the indefinite D's, it is only 
able to employ a(n), and resists co-occurring with some or no. 


(10 a. How long a novel did she write? 
b. * How long the novel did she write? 
c. *How long some novels did she write? 
d. * How long no novel did she write? 

(11) a. She wrote more fascinating a novel this time. 
b. *She wrote more fascinating the novel this time. 
c. *She wrote more fascinating some novels. 


* She wrote more fascinating no novel. 


If a(n) has the features [+INDEF, -PLURAL] while some is [+INDEF, +PLURAL] and 
no is [-INDEF] but lacks a feature specification for plurality, we can stipulate for the 
explicitness of description that degree inversion is restricted to D's that carry the feature 
specification +INDEF, -PLURAL.* 


^ Tt is worth noting that there is another formative some that is stressed and is definite. In addition, the 
table is offered only to make explicit the morphological use of indefinite in the text. It is not a theoretical 
claim. If we were to decide that we needed to specify no as PLURAL we could appeal to a feature CARD for 
cardinality and stipulate that degree phrase inversion could only occur with D's specified as -CARD. By the 
same token, we might have reason for adopting a morphological theory that avoided features altogether. 
In any scenario we need to restrict Degree Inversion to the D a(n) and not other members of the D class, 
much as /n/ is added to /a/ and not other indefinites. 
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Table 1: Co-occurrence of indefinite D’s with count nouns 


Indefinite D | Singular N Plural N 

a a novel "a novels 
some *some novel | some novels 

no no novel no novels 


2.2 OF as sensitive to second position 


The of described in the preceding subsection is not capable of bearing stress. Nor is it 
able to appear initially in a nominal domain. When it surfaces, it always occupies second 
position in the nominal expression. To see this point, consider sentences like (12) that 
contain two pre-nominal adjectives. If long is part of a wh-phrase, it preposes (as in 
(12b)), and if romantic is part of a degree phrase, it may prepose, as in (12c). However, if 
both are before a, the result is the unacceptable (12d). 


(12) 


Jane Eyre is a long romantic novel. 


a 
b. How long of a romantic novel is Jane Eyre? 


o 


jane Eyre is more romantic of a long novel (than Middlemarch). 


R- 


* How long more romantic of a novel is Jane Eyre? 


We begin to capture these facts if we assume that the preposed wh-phrase in (12b) and 
degree phrase in (12c) target a (unique) specifier position for movement (i.e. specifier of 
DP). We might accomplish this by way of (13a) that gives a(n) an EPP feature, parallel to 
what is often assumed for T. We could then posit a spell out rule that prefixes of. 


(13) a. There is an optional feature on [p a] that requires a filled specifier. 


b. This feature is optionally spelled out as the phrasal prefix of. 


3 Could OF be a syntactic head of phrase? 


Some researchers have suggested that the of that occurs in the Degree Inversion struc- 
tures is a syntactic head of phrase? On this view we should assign how long of a vacation 
the structure in (14), whereas the proposal I have suggested in (13) is (15), if we assign of 
to the category F and adopt an item and arrangement approach to affixation in order to 
facilitate the comparison of the two theoretical viewpoints. 


5 This structure is advocated in Kennedy & Merchant (2000) and is assumed in other work, for example, 
Borroff (2006). The labels of constituents differ in other analyses, in particular Matushansky (2002) and 
Kim & Sells (2011), but the constituency is broadly the same. The crucial point is to contrast analyses that 
treat of as a head of phrase from the analysis defended in the text where of is a phrasal affix. 

In a word-and-paradigm approach of might not be given a structural position at all but simply spelled out 
at the left bracket D. This point of view is probably closer to Anderson's. I believe that the structure in (15) 
gives some insight into why of occurs between the D and its specifier, as I explain at the end of this paper. 
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(14) FP 
DegP F 
EE, 
how long "int 
| di 
of D 
D NP 
| 
a Pon c 
DegP N 
pr | 
howHong N 
| 
vacation 
(15) DP 


DegP 


D 
— nn uu Teens 
F D 


how long 


D NP 
| 

a pod 
DegP N 

Que Sy | 
how long N 

| 

vacation 


Some authors, following the lead of Bennis, Dikken & Carver (1998), have tried to treat 
of as parallel to a copular construction in which predicates have been claimed to invert 


in (16). 


(16) a. Jill was a much better friend. 
b. A much better friend was Jill. 


Den Dikken (2006) offers an extended analysis in this vein over a range of languages. 
However, I find compelling the argumentation in Heycock & Kroch (1999) that rejects 
the inversion analysis in (16) in favor of viewing these as identity statements. On this 
view there is no basis for the parallel between (16) and (14), as Heycock and Kroch note. 
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Four reasons lead us to avoid a structure like (14)." First, heads of phrases are able to 
stand in a selection relation with other heads of phrases. Yet, there is no lexical head 
that ever selects the FP of (14). This fact is accidental if of is a head of phrase, but it is a 
necessary consequence if it is not actually a head, as in (15).? Second, the of phrase in 
(14) is immune to syntactic dislocation. Other of phrases are able to prepose or postpose, 
as shown in (17) and (18). It is also possible to strand of in examples parallel to (19). In 
this sense, they are syntactically active. 


(17 a. Three of these books have been reviewed. 
b. Ofthese books three have been reviewed. 

(18) a. The announcement of the Nobel Prize in Chemistry was delayed. 
b. The announcement was delayed of the Nobel Prize in Chemistry. 


(19) What book did the newspaper publish a review of? 


Yet (14) stands in contrast to examples like (17) in being unable to be preposed, post- 
posed, or stranded. They appear to be syntactically inert.’ 


(20) a. More extensive of an experiment was in the planning stages. 


b. *Ofan experiment more extensive was in the planning stages. 


(21 a. More extensive of an experiment was delayed 


b. * More extensive was delayed of an experiment 


(22) "What (kind of) experiment did the journal report more extensive of? 


However, if of is not a head of phrase with its own phrasal projection as in (15), the lack 
of syntactic dislocation is expected. 

A third challenge to the analysis in (14) involves conjunction. Locative prepositions 
like on can be repeated members of a conjunction with only, presumably for pragmatic 
reasons related to defeasing conversational implicatures that would otherwise obtain. 
The same seems true of of that introduces the complement of derived nominals like 
destruction. 


7 Some work has suggested that the fronted how long must originate as a predicate adjective. See, for example, 
Wood & Vikner (2011) and Troseth (2009). This seems unlikely, at least for English. Expressions like a 
beautiful dancer are ambiguous between a predicative reading in which the dancer is beautiful and a non- 
predicative reading in which the dancer dances beautifully. The ambiguity is preserved in how beautiful (of) 
a dancer. This fact is surprising if the preposed degree phrase could only arise from a predicate adjective. 
These facts are general and can be reproduced with a skillful manager. 

5 This challenge can be given a sharper formulation if we accept the stipulation that the complement of V is 
reserved for selected arguments of V. This stipulation requires that V select of, if of is a head of phrase. To 
the extent that every V that selects DP also selects this of we miss a regularity in the system of selection, a 
cost that is not incurred if of is provided as a phrasal affix. One can imagine mechanisms that allow of, if it 
were a head of phrase, to inherit argument properties of its complement DP, but such mechanisms would 
obscure the generalization that of is syntactically inert generally. 

? A reviewer observes that this second challenge can be met by stipulating that only maximal phrasal pro- 
jections are available for movement. While this defense of the head of phrase analysis will side step the 
second challenge, it does not generalize to meet the other challenges. For this reason the phrasal affix 
account seems to me to offer a more straight forward explanation, all other things being equal. 
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(23) a. You may place the notebook on, and only on, the desk. 
b. The tornado caused the destruction of, and only of, one residence. 
c. You may place the notebook only on the desk. 


d. The tornado caused the destruction only of one residence. 


In contrast, the of that appears with degree phrase inversion is unable to appear in simi- 
lar conjunctions. If of is syntactically and semantically inert, as the phrasal affix analysis 
contends, it could not conjoin and co-occur with only to defease any conversational im- 
plicatures. 


(24) You can’t find more valuable of a proposal. 


a 
b. * You can’t find more valuable only of a proposal. 


o 


* You can't find more valuable of, and only of, a proposal. 


œ 


How long of a vacation did they take? 
e. * How long of, and only of, a vacation did they take? 


A fourth reason to be skeptical of the analysis in (14) is that it is unable to generalize 
to explain the appearance of of in fractions. Fractions have been treated as complex 
determiners since the classic work of Keenan & Stavi (1986) on the conservativity of de- 
terminers. Fractions exhibit this semantic property, a fact that is easy to explain if they 
are (complex) determiners, but that looks accidental if they are assimilated to construc- 
tions that take of to be a head of phrase, like the partitive constructions discussed in the 
next section. The use of of in the fraction in (25) parallels its use in degree phrase inver- 
sion. In both cases of is adjacent to the indefinite D a(n), in both cases of is optional, 
and in both cases of lacks the ability to be lexically selected, or syntactically preposed 
or stranded as illustrated in (25). The phrasal affix analysis in (15) can provide a unified 
analysis, on the assumption that fractions occur in specifier of DP.!° 


(25) The recipe called for one half of a cup. 


TOP 


* Of a cup the recipe called for one half. 


o 


* A cup the recipe called for one half of. 


R- 


* The recipe called for one half of, and only of, a cup. 


4 Partitive OF is a syntactic head of phrase 


There is a second construction involving of which should be distinguished from of asa 
phrasal affix in Degree Inversion structures. This second type of construction is exem- 
plified in (26) and has traditionally been labeled a partitive. 


10 Tt is tempting to extend this analysis of fractions to proportions like six of ten adults don’t vote. Proportions 
like this do not make use of the D a(n) and are plural rather than singular. Without a more detailed analysis 
ofthe proper structural analysis of cardinals like ten, and an understanding of why proportions admit some 
form of dislocation (e.g. (out) of (every) ten adults six don't vote) the extension of the analysis will have to 
remain incomplete. 
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(26) a. She invited ten of the students to the party. 
b. Jack offered to buy any three of her paintings. 


c. Sandy knows the answer to the last (one) of those questions. 


(27) DP 


The of in these partitives is a syntactic head of phrase, appearing in structures like (27), 
where of heads a PP. With the exception of half (of) the, partitive of appears obligatorily, 
like other heads of phrases and in contrast to the pattern of of with Degree Phrase 
Inversion. In addition, the of phrase can be topicalized as a syntactic constituent. In this 
respect it differs from of in Degree Inversion constructions like (20). 


(28) a. Of the students, she invited ten to the party. 
b. Of her new paintings, Jack offered to buy any three. 


c. Ofthose questions, Sandy knows the answer to the last (one). 


The of phrase can also be co-ordinated, a standard test for constituency. Examples are 
somewhat limited, presumably because of semantic restrictions on partitives. However, 
naturally occurring examples are not difficult to find." 


(29) a. If you accept it to be a part of you and of your life, you gain control of the 
illness. 


b. I try to be very aware of all of the students and of their strengths and weak- 
nesses. 


In contrast, Degree Inversion structures do not show the same support for co-ordination. 


(30) a. * More expensive of a house or of a car would be hard to find. 


b. * How good of a novel or of a play did she write? 


It is also possible to use expressions like only with partitive of, in contrast to the of in 
Degree Inversion structures. 


(31) a. She invited ten ofthe linguistics students, and only of the linguistics students, 
to the party. 


b. * How good of a novel, and only of a novel, did she write? 


H (29a) occurs at https://www.sicknotweak.com/2016/08/its-a-piece-of-you-make-it-yours/ as accessed 
September 24, 2016. (29b) occurs at http://www.ballet-dance.com/200505/articles/GloriaGovrin20041200. 
htm and was accessed September 24, 2016. 
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My claim that of in Degree Inversion structures is structurally distinct from of in 
partitives can be better appreciated if it is contrasted with the classic analysis presented 
in Bresnan (1973). Bresnan suggests that of should be inserted in examples like (33) by 
an elegant rule like (32).? 


(32) O0- of /Q_DN 

(33) She has enough of a problem as it is. 
(34) more of an egg 

(35) more of the egg 


(32) is similar to the phrasal clitic analysis I suggested earlier in that of is inserted rather 
than being treated as a syntactic head of phrase. It differs from the phrasal clitic analysis 
in two ways: (i) it does not require Q to be complex, and (ii) it does not require D to be 
a(n). 

The first of these differences is more apparent than real. I say this because Bresnan 
argues that more originates as a syntactically complex constituent -er much, and that 
this complex constituent is replaced in the course of the derivation with more. Bresnan 
analyzes enough in a parallel complex structure, with the difference that enough neces- 
sarily co-occurs with a null specifier. If this complex structure is correct, there is a point 
in the derivation where (34) is not a counter-example to our phrasal affix account of of; 
we only need to insure that the phrasal clitic of is inserted prior to -er much suppletion. 

Consider now the second way in which rule (32) differs from the phrasal clitic analysis 
that I suggested in §2. By inserting of before any D, the rule in (32) loses the important 
empirical generalizations about the way in which of in (35) patterns like a syntactic 
head of a phrase in terms of conjunction, the distribution of only, and the possibility 
of movement. It does not code naturally the way in which the of in Degree Inversion 
constructions patterns differently along these dimensions. In the absence of other ev- 
idence, considerations of simplicity would make (32) quite attractive, but in this case, 
that very simplicity obscures important ways in which of patterns in distinct, rather 
than haphazard, ways. Moreover these distinct patterns have a semantic reflex. Peters 
& Westerstahl (2006) argue that semantic definiteness is a necessary characteristic of 
partitives. From this perspective the distinction between (34) and (35) that the phrasal 
affix analysis encodes is a natural one. 


5 OF in complements of nominalization and syntactic 
Case theory 


Since Chomsky (1981) it has been popular to say that the presence of of after nominaliza- 
tions like picture in (36) is required to provide the nominalization’s complement (Mary 


12 Stockwell, Schachter & Partee (1973) also distinguished partitives with of from other constructions where 
of surfaces. Selkirk (1977) distinguishes true partitives from pseudo-partitives like two pounds of turkey in 
part by pointing to differences in the distribution of of in the two types of constructions. 

13 Q in (32) is intended to denote quantified expressions like more and enough. 
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in this example) with Case. The details of the required derivation have rarely been made 
precise.^ This imprecision is remedied in an interesting way in Kayne (2002). There, 
Kayne gives a careful description of the insertion of of in (36) and links it to an expla- 
nation for why the of in derived nominals like (36) can show stranding effects like (37), 
superficially in violation of subjacency requirements. 


36) John was admiring a picture of Mary. 
37) Who was John admiring a picture of? 


38) John was [vp admiring [pp a [wp picture of Mary ]]]]] 


Sak es pees peek 


39) John was [ of [xp K — of [yp admiring [pp a [wp picture Mary III 


If one thought that the structure of (36) was like (38) where of formed a constituent 
with the logical complement of picture, Mary, and was internal to the VP, one has trouble 
explaining in a non-stipulative fashion why (37) is not a subjacency violation (hence the 
debate between Bach & Horn 1976 and Chomsky 1977). Kayne's suggestion is that there 
is an abstract functional head K-of that is merged into the syntactic structure outside of 
the VP. On his assumption (defended in Emonds 2000) that every lexical noun requires 
Case valuation from an appropriate head, Mary will be unable to be so valued by the 
verb admiring. Instead it will be forced to move to specifier of K-of. The formative of is 
merged above KP and provides another specifier position for the remnant VP to move to. 
This derivation will provide the linear order observed in (36) without treating a picture 
of Mary as a constituent. In this way the challenge of (37) to subjacency is side-stepped. 

Kayne sees a fundamental unity between the presence of of in nominalizations like 
(36) and its appearance in constructions like (40) and (GOU? 


(40) He has lots of money. 
(41) They bought too big of a house. 


Kayne suggests that lots in (40) and big in (41) have a nominal property and, for that 
reason, have a Case feature that needs valuation. Because they intervene between the 
verbs has and bought and money and house respectively, they prevent the verbs from 
valuing the Case feature on those nouns. This intervention forces the need for of. The 
derivation of (40) and (41) parallels (39). A functional head of (and K-of ) is merged above 
VP. The nominals money and house raise to specifier of K-of where their Case features 
can be valued. The remnant VP subsequently moves to a specifier position on the left in 
order to get the appropriate left to right linear order. 

Kayne's analysis of (36) makes of deeply syntactic in the sense that it takes of to bea 
head of phrase that enters into selection relations and that also provides a landing site 
for other putative syntactic operations. It is also syntactic in the sense that, whether 
it is ultimately judged to be true or not, it ultimately aims to resolve a syntactic prob- 
lem (regarding the ability of (37) to avoid subjacency like island effects). The extension 


14 Harley & Noyer (1998) is a notable exception to this trend. 
5 Examples like (40) are the pseudo-partitives of Selkirk (1977). 
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of the analysis to the Degree Inversion construction in (41) is not deeply syntactic in 
the same sense: the of in (41) does not enter into other independently required syntac- 
tic operations. For example, it is unable to feed topicalization or clefting to strand the 
preposition: 


(42) a. (It is) Mary John was admiring a picture of. 
b. (Itis ) Money he has lots of. 
c. *(It is) a house John bought too big of. 


Further, the of in (41) is optional, while it is obligatory in (40) and (36). This fact leads 
Kayne to posit that in some situations we should stipulate that multiple Case evaluation 
is available, allowing both big and house to be Case valued by the verb, thereby circum- 
venting the intervention effect. We should also note that unlike (36), there is no syntactic 
problem that the Case-intervention account of (41) provides a resolution to. 

My claim is that there is no deeply syntactic foundation for the putative structure 
(14), and in this sense there is a fundamental asymmetry between the Case licensing of 
in (36) and the of that appears in Degree Inversion structures like (41). This lack of a 
deeply syntactic account of of in Degree Inversion constructions is what opens the door 
for the phrasal affix account.!° Parity of reasoning prompts the question of whether the 
analysis of of in degree inversion structures like (41) is deeply morphological in any 
comparable sense. I would like to suggest that, rather than simply being a competing 
analysis, the phrasal affix analysis of of is deeply morphological, offering a response to 
a fundamental morphological puzzle. From this vantage, the contention that there are 
phrasal affixes poses the puzzle of how these affixes interact with the syntax of phrases 
and whether that interaction is principled in any morphological sense. Of course, Ander- 
son's perspective on this puzzle is to say that phrasal affixes are syntactically inert be- 
cause such morphological operations are ordered as a block after syntactic operations by 
virtue of the architecture of grammars. This paper has offered a different view that does 
not necessarily assume a late, or even a block, ordering of phrasal affixation. The analy- 
sis of of as a phrasal affix advanced here treats phrasal affixes as structural adjuncts, and 
by virtue of that structural relation, as inert to canonically syntactic operations such as 
movement. 


6 Envoi 


This paper has provided an analysis of of as it appears in Degree Inversion construc- 
tions and in fractions. It has been suggested that of in these constructions marks a 
formal relation between the indefinite D a(n) and its specifier, and that structurally it is 


16 Tt is possible to posit a syntactic feature, say +EPP, on some instances of a(n) and stipulate that the feature 
triggers of as its spell out phonologically. While I think this style of analysis could be descriptively suc- 
cessful, it would not be either deeply syntactic or deeply morphological as I have been using these terms. 
It would stipulate that of appears to the left of a(n), for example, rather than linking its appearance to a 
second position effect. 
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a D adjunct. This occurrence of of is distinct from its presence after derived nominals, 
which has been attributed by a number of researchers to Case theoretic requirements, 
and the of that appears in partitive constructions. This paper has defended the spirit 
of Anderson's claim that phrases can be provided with inflectional affixes by showing 
that the claim can offer an explanation for phenomena that are, from the syntactic point 
of view, unprincipled. In the process it draws attention to the kind of phenomena a 
morphological analysis is well suited for: formatives that are syntactically inert but sen- 
sitive to morphological class features, prosody, and to linear position (second position). 
I have also suggested that phrasal affixes should have the syntax of structural adjuncts. 
Whether this last point is valuable depends on a substantive theory of adjuncts and a 
comparison of other putative instances of phrasal affixes. For example, it has been sug- 
gested in Chomsky (1986), and argued for in some empirical detail in McCloskey (1996), 
that adjunction to lexically selected phrasal projections is prohibited. If this adjunction 
prohibition is correct, and if phrasal affixes must enter a derivation as adjuncts, we ex- 
pect them to avoid adjunction to lexically selected phrasal projections. In effect, they 
must enter the derivation adjoined to a lower phrasal projection. I draw attention to this 
implication because it can provide some foundation for the second position effect that 
the phrasal affix of exhibits. The intuition here is that if of were adjoined to the DP, it 
would violate the prohibition on adjunction to a lexically selected phrase. It can only 
appear on the lower phrase, D in (6), in effect producing a type of second positioning. 
Alternatively, one could stipulate that the phrasal affix of carries a constraint against 
appearing on the left edge of a phrase. A similar tension between syntactic and morpho- 
phonemic edge effects is present in explanations of verb second phenomena in various 
languages, as Anderson (2005) has observed, especially in regard to the description of 
Swiss Rumantsch. The choice between these explanatory crossroads will have to remain 
undetermined for the moment since it will inevitably depend on assumptions about the 
relationship between hierarchical order (i.e. c-command) and linear order. In forthcom- 
ing work on linking elements in Austronesian compounding, I hope to produce some 
reason for preferring the explanation in terms of adjunction that I sketched here. 
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Chapter 18 


Split-morphology and lexicalist 
morphosyntax: The case of 
transpositions 


Andrew Spencer 


University of Essex 


One of Anderson’s many contributions to morphological theory is the claim that morphol- 
ogy is split between syntactically mediated inflection and lexically mediated derivation. In 
Minimalist morphosyntax all morphology is syntax. This means that the split morphology 
proposal is not meaningful for that model. In lexicalist models, however, the split morph- 
ology hypothesis manifests itself as a distinction between direct accessibility to syntactic 
representations (inflection proper) and lack of accessibility. However, there are construc- 
tion types which bring the inflection-derivation distinction into question. One of these is 
the transposition, as illustrated by the ubiquitous deverbal participle. This is a mixed cate- 
gory, being at once a form of a verb yet having the external syntax of an adjective. It is thus 
unclear which side of the split participles fall. Similarly, participles seem to be an embarrass- 
ment for the Word-and-Paradigm models of inflection which have become dominant since 
Anderson first introduced them to contemporary theorizing. This is because they seem to 
require us to define a "paradigm-within-a-paradigm" (or *quasi-lexeme-within-a-lexeme"). 


I provide an analysis of Russian participles within Stump's PFM2 model, deploying the 
model of lexical representation developed in Spencer (2013), which fractionates represen- 
tations into more finely grained subcategories than is usual. I take a participle to be the 
adjectival representation of a verb, coded directly by means of a set-valued feature, REPR. I 
show how a set of rules can be written which will define the adjectival paradigm as a set of 
forms belonging to the overall paradigm of the original verb lexeme. The rules define a par- 
tially underspecified lexical entry for the participle (“quasi-lexeme”), which has essentially 
the same shape as the lexical entry for an (uninflected) simplex adjective. Thus, the partici- 
ple’s lexical entry is that of an adjective, just as though we were dealing with derivation, 
but it realizes the verbal properties of voice/aspect and it shares its semantics and lexemic 
index with its base verb, just as in the case of verb inflection. The participle thus straddles 
the split, but in a principled fashion. 


Andrew Spencer. 2017. Split-morphology and lexicalist morphosyntax: The case 
of transpositions. In Claire Bowern, Laurence Horn & Raffaella Zanuttini (eds.), 
| On looking into words (and beyond), 385-421. Berlin: Language Science Press. 
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1 Introduction: Morphological architecture 


Since Anderson (1977) (re-)introduced to generative grammar the traditional notion of 
“word-and-paradigm”, and particularly the ground-breaking work of Matthews (1972) on 
Latin inflection, morphologists have been grappling with the challenge of providing an 
adequate characterization of the key notions “word” and “paradigm”. 

Central to this debate has been the fate of the Bloomfield/Harris interpretation of 
the morpheme concept (Anderson 2015) and the notion of “Separationism” (Beard 1995). 
The “word” notion presupposes (at least) a word/phrase distinction. (In morpheme-based 
approaches no such distinction is necessary and all morphology and syntax can be sub- 
sumed under a model of morphotactics.) There are very well known problems with any 
attempt to find necessary and sufficient conditions for the ‘concrete’ instantiations of 
word — the phonological word, (inflected) word form, syntactic word, even. This is gen- 
erally on account of incomplete grammaticalization, which strands constructions and 
formatives in a limbo between the status of function word - clitic - affix, compound 
element- affix, analytical syntactic construction - periphrasis and so on. However, the 
‘abstract’ notions of word are no less problematic, specifically the lexeme and the mor- 
phosyntactic word (i.e. an inflected word form together with the morphosyntactic prop- 
erty array that it realizes). Defining the set of morphosyntactic words often requires us to 
make decisions about what constitutes a word form, which brings us back to the issue of 
clitics, periphrasis and so on. It also requires us to make sometimes arbitrary decisions 
about morphosyntactic property sets (MPSs) in the light of form-content mismatches 
such as (some types of) syncretism (Baerman, Brown & Corbett 2005), overabundance 
(Thornton 2012), defectiveness (Sims 2015), and deponency (Baerman et al. 2007), as sum- 
marized in Stump (20162). 

Inflectional properties, and word-oriented functional categories generally, such as def- 
initeness (for nouns) or modality (for verbs) in English, seem to presuppose an inflection 
~ derivation dichotomy that is notoriously hard to pin down. Broadly speaking it dis- 
tinguishes the creation of new lexical items/units (generally, Saussurean signs pairing a 
cognitive meaning with a set of forms) from forms of a lexical item/unit. The component 
of grammar that defines new lexical units or lexemes is derivational morphology. How- 
ever, as Spencer (2013) itemizes in some detail, such a (canonical) inflection/derivation 
dichotomy represents just two poles of a scale of types of lexical relatedness. Some of 
the intermediate types of relatedness pose problems for any clean characterization of 
the lexeme concept (part of the problem of "lexemic individuation", Spencer 2016b). 

One way of characterizing the core of the inflection/derivation distinction is the no- 
tion of split morphology (Anderson 1982: and subsequent references). The essence of the 
distinction can be thought of as an interface claim: inflection interfaces directly with syn- 
tax (‘inflection is what is relevant to syntax’). The obverse to this claim is that derivation 
interfaces with the lexicon, in the sense that derivation is what gives rise to expansion 
of the lexical stock (as well as defining relatedness between already fixed lexical entries), 
in other words derivation is ‘what is relevant for the lexicon’. Anderson implements 
this architectural claim by saying that it is the rules of syntax themselves which define 
inflectional morphology. 
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This move raises the important question of what model of syntax we are presup- 
posing. Most versions of the Minimalist Program presuppose something very close 
to the model of morphotactics proposed by the American Structuralists: the atoms of 
representation are morphemes, morphology and syntax are identical (it is therefore a 
terminological choice whether we think of sentence construction as morphotactics or 
syntax), and the notions lexeme, word form, inflection, derivation, inflectional para- 
digm are at best heuristic descriptive terms which cannot be given a coherent defini- 
tion within the model. The natural syntactic framework for investigating a word-and- 
paradigm, or rather, lexeme-and-paradigm approach to inflection is, perhaps, a lexical- 
ist, or constraints-based, model (Miller & Sag 1997; Sadler & Spencer 2001; Sadler & 
Nordlinger 2006). In that case the question of split morphology takes on a somewhat 
different aspect. Rather than claiming that syntactic rules construct inflected forms as 
such, we must say that inflected forms, compared with derived lexemes, are permitted to 
interact in specific ways with syntactic representations, or equally that inflected forms 
bear properties which are visible to syntactic representations and principles. 

The obvious way to implement this idea is to say that the abstract characterization 
of an inflected word includes a morphosyntactic description which overlaps with that 
of a corresponding syntactic representation. A concrete version of this type of overlap 
is seen in the form-content mapping, as defined in Stump’s notion of paradigm linkage 
(Spencer & Stump 2013; Stewart & Stump 2007; Stump 2002; 2006; 2016a,b). Stump dis- 
tinguishes morphological properties, the FORM paradigm, from syntactic properties, the 
CONTENT paradigm. By default these are homologous, but there are many instances of 
mismatch. For example, Latin syntax distinguishes singular and plural number and a va- 
riety of cases, including dative and ablative, but those two cases are never distinguished 
morphologically for any lexeme in the plural. On the other hand, Spanish verbs have 
two distinct subparadigms for the imperfect subjunctive but that distinction is nowhere 
reflected in the syntax. Other mismatches include deponency and periphrasis. To a lim- 
ited extent we can say that the form-content paradigm distinction is a reflex of covert 
split morphology: such a distinction is not definable for derivational morphology. 

In lexicalist models, the derivational morphology ~ lexicon interface operates over 
property sets which don’t play a direct role in syntax. The hedge “direct” is important: 
typically, derivation is relevant to syntax, in the sense that it changes a lexeme’s mor- 
phosyntactic class. More subtly, derivation may make appeal to argument structure 
realization (witness English Subject Nominalizations, able-Adjective formation and so 
on). But if it is assumed that lexemes have a representation of their word class argument 
structure and other relevant properties, then such syntactically expressed relations can 
be defined over lexical representations, as extensively argued in the constraints-based 
literature (see Wechsler 2014: for a review). This is effectively a statement of the doc- 
trine of lexical integrity, at the abstract level of representation as defined by Ackerman 
& LeSourd (1997). 

The conclusion to be drawn is that morphology interfaced with a constraints-based 
syntax needs to be split in essentially the way proposed by Anderson, but as an abstract 
architectural property, which sometimes bears a rather complex relation to concrete mor- 
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phophonological expression. Inflectional morphology maps to syntactic representations 
in a way in which derivational morphology is unable to, while derivational morphology 
serves to define (specific kinds of) lexical relatedness. However, there remain interest- 
ing cases of violations of lexical integrity with derivational relations, in which syntax 
appears to have access to the internal structure of the derived word. This paper will ar- 
gue that such phenomena require us to extend the scope of the split in morphology in a 
way that takes the notion of "lexeme" as syntactic atom seriously, and which ultimately 
provides conceptual motivation for a lexicalist interpretation of split morphology. 

A case in point is the class of denominal (relational) adjectives in many languages, 
which allow one noun to modify another by taking on the morphosyntax of an adjective. 
In European languages, including English and Russian, such adjectives respect lexical 
integrity, in the sense that the base noun is opaque to syntax. For example, the base 
noun kniga ‘book’ in the Russian relational adjective kniznyj ‘pertaining to a book/books' 
is opaque to agreement, government or any other syntactic process, just as in English 
the noun tide in tidal is opaque. For instance, the phrases poderZannaja kniga 'second- 
hand book’ and kniZnyj magazin ‘book shop’ do not gives us "poderZannaja/poderzannyj 
kniznyj magazin, and although we can say high tide and tidal barrier we can't say "high 
tidal barrier. The importance of these observations is that there are languages in which 
just such attributive modification into a derived adjective is possible (see the discussion 
of Tungusic and Samoyedic examples in Nikolaeva (2008), and also the detailed discus- 
sion of the Samoyedic language Selkup in Spencer 2013: Chapter 10). 

The relational adjectives of Russian and English, however, share one important prop- 
erty with the Tungusic and Samoyedic pure relational adjectives, namely, they have 
precisely the same lexical semantics (cognitive content) as the base noun. This leaves 
us with the question of how to explain why in some languages relational adjectives are 
opaque and in others they are transparent to attributive modification. 

Spencer (2013) argues that the crucial difference between true transpositional rela- 
tional adjectives of, say, Selkup, and the "fake" transpositions of English/Russian is that 
true transpositions are effectively forms of the base noun lexeme, while the English and 
Russian relational adjectives are distinct lexemes, though ones which have a semantic 
representation identical to that of their base, what Spencer (2013: 275) calls a transpo- 
sitional lexeme. Other types of transpositional lexeme include English property nom- 
inalizations (kindness, sincerity, ...), deverbal nominalizations such as destruction, and 
participial forms which have been converted into qualitative adjectives such as (very) 
boring/bored, charming, excited, .... A relational adjective which is a true transposition 
permits inbound attributive modification because it is, in an important sense, still a noun, 
just as a noun stem marked for number, case, possession or definiteness is still a noun. 

One consequence of this reappraisal of the morphology~syntax interface is that the 
crucial divide can no longer be straightforwardly equated with a traditional distinction 
between inflection and derivation. It is not appropriate to think of a deverbal partici- 
ple or a relational adjective as merely an inflected form of a verb or noun, because that 
participle or adjective will in general inflect like an adjective, not like a verb/noun. How- 
ever, following Haspelmath (1996), Spencer (2013: Chapter 10) argues for an enrichment 
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of the traditional notion of the inflection paradigm to include an attribute REPRESEN- 
TATION (taken from Russian descriptive practice). Thus, a participle is the adjectival 
representation of the verb, and as such it can have its own adjectival inflectional para- 
digm. It thus has the outward appearance of an autonomous lexeme, but appearances 
are deceptive. Rather, the transposition is a *quasi-lexeme", and the transpositional rela- 
tionship therefore represents a particularly striking instance of a deviation from inflec- 
tional canonicity.! Like transpositions, these are not usually described under the head- 
ing of morphology-syntax mismatches, and like transpositions they are not discussed in 
Stump's (2016a) otherwise very detailed survey. 

In the model of lexical representation argued for in Spencer (2013) the notion "form of 
a lexeme" in this somewhat extended sense is reflected very simply: all forms of a lexeme 
share their Lexemic Index. This leads us to propose a (no doubt too strong) hypothesis 
about the nature of the split in morphology: 


Principle of Lexemic Transparency 

Let D be a word derived by some regular morphological process from 
a word B, possibly of different morphosyntactic category. If mor- 
phosyntactic processes treat D in the same manner as they would 
treat the base word, B, even where the category of D is such that we 
would not otherwise expect it to be subject to such processes, then D 
is a form of the lexeme B (shares B's Lexemic Index). 


Ishall argue that the architectural equivalent of splitting inflectional from derivational 
morphology is this modification of the notion of lexical integrity: if morphology defines 
a set of forms of a lexeme, rather than defining a new, autonomous lexeme, then those 
forms will show lexical transparency. Derived lexemes, however, show lexical opacity 
(one reflex of which is the more familiar property of lexical integrity). This paper will 
illustrate that proposal on the basis of the behaviour of Russian deverbal participles. 
These are particularly useful. First, Russian adjectival morphosyntax is very clearly dis- 
tinguished from noun or verb morphosyntax, so it is easy to show that the participles be- 
have like adjectives. Second, the Russian past tense and conditional mood are expressed 
by a form which is historically a participle and which show participle-like agreement, 
but which has been reanalysed as a verb form (the l-participle). This contrasts in impor- 
tant ways with the true participles. Third, like many languages, Russian often converts 
its participles into true qualitative adjectives. These have (almost) exactly the same set 
of forms as the participles but their syntax is no different from that of a simplex adjec- 
tive. The true participles are like the l-participle in showing lexical transparency with 
respect to the base verb. They therefore both appear on the inflectional side of the split 
morphology. This is because they are forms of the verb’s paradigm, and do not constitute 
independent lexemes in their own right. They contrast with the converted participles, 


! Other such deviations are certain forms of evaluative morphology (cf "the diminutive form of a noun") and 
grammaticalized argument structure alternations (cf "the passive/anti-passive/applied/causative form of a 
verb’). 
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which are autonomous lexemes and hence opaque with respect to the verb properties of 
their (etymological) base lexeme. 


2 Lexical representations and lexical relatedness 


2.1 Introduction 


Our discussion will require us to be explicit about a number of aspects of lexical repre- 
sentation and the way that words, in the broadest senses of the term, are related to each 
other. I shall adopt a generalized form of Stump’s (2001) Paradigm Function Morphol- 
ogy, which I have called Generalized Paradigm Function Morphology, GPFM (Spencer 
2013). The GPFM model is designed to permit us to use the machinery of PFM to de- 
scribe types of lexical relatedness which go beyond normal inflectional morphology. It 
thus extends the lexical representations that morphology has access to by incorporating 
representations of syntactic properties and the lexical semantic representation of words. 
In GPFM lexemes have to be individuated by means of an arbitrary index, the Lexemic 
Index (defining something like the key field in a database). One of the reasons for this 
is because it is arguably not possible in the general case to individuate lexical represen- 
tations of lexemes in terms of any of the linguistically relevant properties that can be 
ascribed to a lexical representation. In addition, however, the index serves an important 
role in distinguishing certain types of morphological relatedness. 


2.2 Lexical representations 


I begin with the descriptive representational apparatus required to characterize an in- 
flected word form, taking inflection to be an uncontroversial category for the sake of 
exposition. I then generalize the representational format to provide a characterization 
of the lexemic entry. 

A word has a minimum of three contentive attributes (together with a fourth, its Lex- 
emic Index, LI): FORM, SYN(TAX), SEM(ANTICS). The SYN attribute records idiosyn- 
cratic selectional or collocation properties, but its main component is the argument 
structure attribute, ARG-STR. This records thematic argument arrays in the standard 
fashion (notated here as x, y, ... variables). However, it also includes a semantic function 
(sf) role. 

For nouns and verbs the sf roles are the “R” and “E” roles respectively, familiar from 
the literature. The ^R" (for “referential”) argument is canonically associated with lexical 
entries whose SEM value belongs to the ontological class of Thing. It thus identifies those 
predicates that typically denote (concrete or abstract) objects and that can serve as the 
lexical head of a referring expression, i.e. a canonical noun. Thus, the “R” argument of 
TREE? corresponds to the “x” variable in the semantic representation Ax. TREE(x). It is the 
argument that is the target of attributive modifiers: (tall) tree (Spencer 1999). See Lieber 
(2004: 16; 55-59) for concrete examples of the R role being deployed in morphology. The 


? Where relevant, I adopt the standard convention of putting the name of a lexeme in SMALL CAPS. 
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"E" (for ‘event(uality)’) argument (sometimes written as "e" or “s” (for "situation")) is 
canonically associated with lexical entries whose SEM value belongs to the ontological 
class of Event. It thus identifies those predicates that typically denote states or events 
(eventualities) and that can serve as the lexical head of a clause, i.e. a canonical verb. 
Thus, the “E” argument of FALL corresponds to the “e” variable in the (neo-Davidsonian) 
semantic representation Ae,x.FALL(e,x). For attributive modification (principal role of 
the traditional adjective class) I assume a semantic function role labelled “A”. This is 
coindexed to a noun's R sf role to represent attributive modification. All adjectives which 
function as attributive modifiers, including relational adjectives and participles, have an 
"A" semantic function role. 

I assume the SEM attribute is essentially a formula in predicate calculus defined over 
the ontological types Thing, Event, Property (Jackendoff 1990), and perhaps others, cor- 
responding loosely to the morphosyntactic categories of Noun, Verb, Adjective. I re- 
main here agnostic as to whether the categories N, V, A are universal and if so, in what 
sense. I assume that some languages also have a category of Adverb, and also transposi- 
tional morphosyntax, adjective-to-adverb (as in English ly-suffixation), verb-to-adverb 
(gerund) and noun-to-adverb (found in Selkup, for example), but I will not have much 
to say about that category here. 

Adpositions may mandate a further ontological category of, say, Relation, but I ig- 
nore that too. The SEM attribute can be thought of as a label for an encyclopaedic 
representation, such as Ax.CAT(x) or Ax,y.WRITE(x,y), but sometimes including linguis- 
tically encoded information relevant to semantic interpretation that cannot simply be 
consigned to an undifferentiated encyclopedia, for instance, Ax,y,5.SIMILAR_TO(x.y,5) 
^ CAT(y) ^ DIMENSION(8), ‘similar to the property of “cat” in some dimension, 8’, or 
Ax,y. AGAIN(WRITE(x y)) ‘to re-write something’. 

The FORM attribute is essentially a record of the word's morphology. Assuming an 
articulated inflectional system, complete with arbitrary inflectional classes and possibly 
other purely morphological, paradigm-based properties, the FORM attribute needs to 
specify all the information needed to locate the word form in the appropriate inflectional 
paradigm. The first property is the morpholexical category, MORCAT. This will typically 
be derived by default from the syntactic category of the representation (the SYN|CAT 
attribute)? but that default mapping is not infrequently overridden, sometimes in rather 
complex ways. 

The next property is largely defined by reference to the syntactic category of the word 
form/lexeme, namely, the morpholexical signature, MORSIG. This specifies all those mor- 
phosyntactic properties for which an element of that MORCAT is obligatorily inflected. 
An inflected word has to have a specification of the morphosyntactic property set (or 
sets), MPSs, that it realizes. In the case of syncretism this may be a (natural or unnatu- 
ral) class of MPSs. For example, a Russian adjective is obligatorily inflected for at least 


? Given the complexities of category mixing it is better to dispense entirely with morphological or syntactic 
category labels such as *verb", "adjective". The required lexical classes can be defined over other aspects of 
representation much more efficiently and it is not difficult in constraints-based models to ensure that those 
aspects of representation are accessible to rules of morphosyntax. However, for convenience of exposition 
I will continue to talk of (morphological or syntactic) verbs, adjectives and so on. 
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the properties of number, gender and case, and these features are therefore listed in the 
MORSIG (a gradable adjective is also inflected for comparative and superlative forms). 
We will see in §5 that the conception of MORSIG assumed in Spencer (2013) can be en- 
riched and extended to include the FORM-CONTENT paradigm distinction introduced 
in Stump (2006) and subsequent work. 

Finally, the representation has to specify a phonological form for the word, through 
an attribute FORM|PHON. The precise characterization of the PHON entry, in general, 
will be given by the rules of inflectional morphology. I will assume that one aspect of the 
PHON representation will be a specification of the STEM on which the inflected form is 
based, but I ignore this refinement because it will not be relevant to the question of split 
morphology. 

The actual inflected word forms of the lexeme are defined by inflectional rules, which 
apply to the pairing (£, c), where o is a complete, permissible feature set for the lexeme 
with Lexemic Index £. The lexemic representation has to include all the idiosyncratic 
morphological information relevant to a lexeme's realized paradigm. In the next subsec- 
tion I summarize the way that the Paradigm Function can be generalized to define not 
only inflection but all the systematic forms of relatedness. 

One aspect of these representations is worth noting. In keeping with the inferential- 
realizational assumptions underlying our model of inflection the SEM attribute remains 
constant for all inflected forms. In particular, there is no characterization at the level of 
the lexical representation of a word form (much less the level of lexemic representation) 
of the semantics of, say, tense or number. What this means is that in the syntax the 
VP which is headed by a past tense form verb may, ceteris paribus, be interpreted as 
referring to an event situated prior to speech time. However, since ‘past tense’ forms are 
also used in sequence of tense constructions, irrealis conditionals and so on, ‘past time’ 
is only the default interpretation. 


2.3 Lexical relatedness 


We can now ask what types of systematic relatedness lexemic entries (i.e. lexemes) can 
exhibit. Spencer (2013) argues extensively that we can find pretty well all the logically 
possible types as defined by the very crude but simple artifice of defining non-trivial 
differences in the four principal attributes, FORM, SYN, SEM, LI. For instance, if we 
consider pairs of representations of distinct lexemic entries, £1, £2, i.e. those with distinct 
LIs, then we can identify several different types of relatedness (usually all treated as 
derivation). 

Suppose that the lexemes £4, £ are distinct in FORM, SYN, SEM representations. Sup- 
pose also that the FORM/SEM representations of £; subsume or properly include (in 
some sense) those of £1. Then we have standard (canonical) derivational morphology, 
DRIVE — DRIVER. Languages sometimes define derived lexemes without changing the 
FORM attribute at all, however. A case in point is that of adjectives which are converted 
wholesale to nouns without any change in morphology (Spencer 2002; see also the dis- 
cussion of Angestellte(r) nouns in Spencer 2013). We will later see examples of derivation 
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in which FORM/SYN/LI attributes are changed but without changing the meaning (trans- 
positional lexeme). 

Now let’s consider what happens if we keep the LI constant, that is, we consider intra- 
lexemic relatedness. To begin with, let us assume that only the FORM attribute can 
change. In the canonical case this is the same as inflectional morphology. Ignoring for 
the present the transpositional interpretation of the participles as a verbal noun or as 
verbal adjectives, we can say that all inflected forms of sinc are forms of a verb. We have 
to be a little cautious when referring to syntactic properties: the syntactic distribution of 
any given inflected form is, in general, distinct from that of other forms. The 3sg subject 
agreement form of a verb does not occur in the same syntactic positions as the 3pl form. 
The properties that give rise to these differences, however, are precisely the MPSs which 
bifurcate into FORM/CONTENT paradigms. This means that we must enrich lexical 
representations in the obvious way: FORM paradigms are defined over features typed 
as FORM features, and CONTENT paradigms are defined over features typed as SYN 
features, with the proviso that the two sets of features are identical by default. Modulo 
the CONTENT paradigm, then, in canonical inflection the SYN value of a given word 
form is identical to that of the other forms of that lexeme. This means that most inflection 
is what Booij (1994) would call contextual inflection.* 

Likewise, the lexeme sinc denotes the same event type in all of its inflected forms, 
and in that sense all word forms share the same SEM representation. In Spencer (2013) I 
argue that there are certain types of inflection that enrich the semantic representation of 
the base lexeme, whilst remaining inflectional. Certain kinds of Aktionsart marking, as 
well as semantic case marking often have this characteristic, as do causative argument 
structure alternations. In traditional descriptions of languages with such inflection we 
often find terminological vacillation, as linguists are unsure whether to label, say, the 
iterative form or the causative of a verb inflectional or derivational (and similar problems 
afflict evaluative morphology). 

The GPFM descriptive framework proposed in Spencer (2013) generalizes the PFM 
model so that all forms of lexical relatedness, from contextual inflection to derivation, 
are defined over four principal attributes of a lexical representation. This requires us 
to generalize the notion of the Paradigm Function to that of a Generalized Paradigm 
Function (GPF), which is like the Paradigm Function except that it consists of four com- 
ponent functions, fform» fsyn» fsem fli. For canonical derivation the GPF introduces non- 
trivial changes to all four components (including the LI). For the converted adjectives 
and Angestellte(r) nouns the fform function has no effect (it can be thought of as a kind 
of identity function). For most inflection, the fsyn,sem,1; functions are the identity func- 
tion, because the GPF simply realizes inflectional properties of the lexeme at the FORM 
level. 

In the GPFM model the lexemic representation is defined in terms of the Lexemic Index 
and a completely underspecified (empty) feature set, u, for example, (PUT, u), a special 
case of the GPF. This maximally underspecified GPF defines just those properties of a 


^ This includes Booij's parade examples of inherent inflection, past tense and plural number. See Spencer 
(2013: 77-82) for critical discussion of Booij's distinction. 
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lexeme (identified by its Lexemic Index) that are completely idiosyncratic and completely 
unpredictable. However, although such a representation reflects the traditional notion 
of a maximally compact, non-redundant dictionary entry, it is not a representation that 
can serve as the direct input to rules of inflection. This is because the lexemic entry 
has to be specified for those inflectional properties that it can and must inflect for. This 
set is defined by the morphological signature, MORSIG. In Spencer (2013: 199) I make 
this rather obvious point explicit in the Inflectional Specifiability Principle. In the current 
context this can be stated as follows: a lexeme is inflected for a given MPS, iff that MPS 
is defined in its MORSIG. 

In Spencer (2013) I treat the MORSIG attribute as part of the FORM paradigm of a 
lexeme. However, we know that the FORM and CONTENT paradigms of a lexeme can 
differ substantially. For this reason, it is necessary to enrich the SYN attribute of a lex- 
emic entry with a (possibly distinct) MORSIG attribute. The values of the (FORM and 
CONTENT) MORSIG attribute are for the most part predictable from other aspects of 
the lexical representation. First, the FORM MORSIG attribute is generally projected from 
the CONTENT MORSIG and by default the two attributes are identical. Second, to some 
extent the meaning of the lexeme can determine the content of the MORSIG attribute. 
Most importantly, however, the MORSIG (which, recall, is essentially a record of the 
properties for which a lexeme inflects) is largely projectable from various aspects of the 
SYN attribute, notably the ARG-STR attribute. Thus, a lexeme with the SYN|ARG-STR 
value (E( x, ...)) (ie. a verb) will by default have the syntax and morphology of a verb. 
The lexemic representation needs to be enriched to include purely idiosyncratic informa- 
tion, such as irregular stem forms, irregular inflections, defective cells or subparadigms, 
and so on. Technically, this can easily be achieved in GPFM by defining very specific 
functions for particular properties over the LIs of the lexemes concerned. For instance, 
the irregular past tense of PUT can be defined by a function defining the STEM,,; form for 
the pairing (Pur, u): GPF((rur, {STEM)s:[PHON})) = /pot/ or similar. This will override 
any less specific (in practice, any other) statement of past tense morphology. Similarly, 
a defective lexeme such as ronco (lacking a past tense form) will have a tense-specific 
GPF((ronco, {tense:pst })) = undefined. This again will override any other statement, in- 
cluding the GPF((Go, {tense:pst })) = /went/, which applies to one other verb based on 
GO (cf underwent) and hence is less specific. 

The role of the MORSIG attribute can be simply illustrated by the English plural. Any 
lexeme with the SYN|ARG-STR|(R ) value licenses MORSIG|num:{sg, pl}, provided that 
its SEM attribute specifies it as a count noun. For a mass noun the MORSIG value will 
be just num:{sg}, while for a plurale tantum noun it will be num:[pl). The {pl}, resp. {sg} 
values for such nouns are therefore undefined, so that a GPF((siNcERITY, {num:pl})) or 
GPF((scissors, {num:sg})) will not correspond to any legal output. 

In summary, I assume a representation for a dictionary entry of a very traditional kind: 
it is minimally redundant, specifying just the unpredictable phonological and semantic 
information, together with any morphosyntactic information that cannot be projected 
from the phonological and semantic specifications. For an entirely well-behaved lexeme 
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belonging to the default inflection class for that word category, this is all the information 
that is required, but that is only because the morphological signature can be projected 
from that entry too. 

To all intents and purposes GPF collapses with PFM for most cases of inflectional 
morphology. However, the model is designed to cover all types of lexical relatedness, 
up to regular derivation, but using essentially the same machinery as PFM. For instance, 
for the derivation of a Subject Nominal such as DRIVER from DRIVE we would have the 
partial GPF shown in (1), where V is the Lexemic Index of a verb lexeme and ô is the 
derivational feature which defines the Subject Nominal formation process. 


(D a fj((V, 8) = A(V), where A is a function over LIs corresponding to the deriva- 
tional feature 6 


b. fsem((V, 5)) = [Thing Ax, PERSON(x) ^ f?'(x)], where P” is a suitable form of 
the semantic representation of the lexeme V 


€ fsyn((V, 5)) =u 
d. fro, (CV, 5)) = Z@er, where Z is the ó-selected stem form of V 


This corresponds to a novel lexemic entry which is exactly like that in (1) except that it 
is defined over the pairing (ACHT, u). If DRIVE is the LI of the verb Drive, then A(‘V) is 
the LI of the derived lexeme DRIVER and (A(V), u) defines its lexemic entry. 

The representation in (1) lacks any syntactic specification. This is because that speci- 
fication can be given by a default mapping from the SEM attribute, by virtue of the De- 
fault Cascade (Spencer 2013: 191-194). Under that principle, regularly derived lexemes 
have their principal morphosyntactic properties projected from their semantic represen- 
tations, in accordance with the notional model of parts of speech. Thus, DRIVER is of 
ontological category Thing and therefore by default has the semantic function role R. I 
assume that the SEM attribute includes an indication that the lexeme denotes a countable 
Thing so that the lexeme licenses the full MORSIG[NUM:sg, pl}. However, the lexeme in 
(1) is derived, not simplex. In Spencer (2013) I propose a Category Erasure Principle, un- 
der which the morphosyntactic properties of a base lexeme in derivation are deleted so 
that they can be overwritten by the Default Cascade. However, given maximally under- 
specified lexemic entries to start with this is probably not necessary: the word formation 
rule interpretation of the GPF takes the only information it has available in a lexical en- 
try, that is the phonology of the root and the meaning, and modifies it, say, by adding an 
affix, and by enriching the SEM representation systematically, for instance, by adding 
a predicate. The Default Cascade then specifies the underspecified properties, includ- 
ing the two MORSIG representations. If the base lexeme’s entry includes non-default 
specifications such as irregular inflected forms or non-standard argument realization, 
these will override the Default Cascade. Illustrative examples of simple inflection and 
derivation are provided in the Appendix. 

I have sketched the GPFM approach to inflection on the one hand, and standard der- 
ivation on the other hand. Non-standard types of derivation are handled by overriding 
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some of the defaults in the GPF. However, this still leaves us with the somewhat intrigu- 
ing, but very widespread set of relatedness types in which FORM/SYN attributes are 
altered, much as in derivation, but without changing either the SEM representation or, 
crucially, the LI attribute, one of the class of relatedness types often called a “transposi- 
tion”. The parade example of this type is, perhaps, the deverbal participle, the ‘adjectival 
representation’ of a verb. 

Transpositions pose particular difficulties for a simple interpretation of the inflection- 
derivation divide (and, indeed, for any architecture with the equivalent of split morph- 
ology): on the one hand, a participle is generally taken to be ‘a form of’ the base verb 
(regular participles are never given their own entry in a traditional dictionary, for in- 
stance). On the other hand, a participle is morphologically and syntactically a kind of 
adjective. If morphology is split, on which side do transpositions fall? 


3 Paradigm Linkage: the problem of transpositions 


Spencer & Stump (2013) (see also Bonami & Stump 2016; Stump 2016a) describe a vari- 
ant of the Paradigm Function Morphology model, PFM2, which explicitly distinguishes 
two types of paradigm. FORM paradigms determine the mapping between MPS’s and 
the word forms realizing them; CONTENT paradigms determine the set of grammatical 
distinctions a lexeme needs to make in a syntactic representation. The two paradigms 
are related by rules of paradigm linkage. In the default case the two paradigms align per- 
fectly: a morphological plural noun is syntactically plural and vice versa. However, there 
are numerous mismatches. A simple case in point is provided by English perfect and pas- 
sive participles. These have distinct syntax (they collocate with different auxiliaries, and 
only the passive participle can be used attributively), but they are never distinguished 
morphologically. Thus, the (single) FORM property (say, VFORM:en-ptcp, or the output 
of a function fen, as in Aronoff 1994) has to map to two CONTENT properties. Stump 
(2016a) provides an extensive survey of the principal types of mismatch encountered in 
the world’s inflectional systems. 

I will argue that we can regard the FORM/CONTENT paradigm distinction as a reflex 
of split morphology for constraints-based lexicalist models of syntax. The reasoning 
is very simple — no corresponding FORM/CONTENT distinction can be mandated for 
derivation. Indeed, given standard PFM2 assumptions it is difficult to imagine what such 
a distinction could mean. 

Incorporating the FORM-CONTENT paradigm distinction into GPFM has consequen- 
ces for the way in which lexical representations are organized. We have seen that the 
FORM attribute of a lexical representation is defined in part in terms of its MORSIG, 
which determines precisely those properties a lexeme must inflect for. In the original 
GPFM model the MORSIG attribute is only defined at the level of FORM paradigms, 
therefore. Since CONTENT paradigms are not always congruent to FORM paradigms 
we will need to draw the appropriate distinction in lexical representations. The obvious 
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way to do this is to assume that each lexeme is associated with two (at least) MORSIG 
attributes, one of which is a value of the FORM attribute and the other of which is a 
value of the SYN attribute. We can call these f MORSIG, c-MORSIG respectively.” 

Stump (2016a: 253) defines paradigm linkage in terms of the Paradigm Function and 
a Corr function. Given a lexemic index £, a set of MPSs o, v, and a stem form, Z, the 
function Corr(( £,0)) delivers a form correspondent (Z,t), such that if PF((Z,t)) = (w,t), 
then PF((£,o)) = (w,t). By default the FORM and CONTENT features are identical, that 
is the set o = q, as defined by a set of property mappings (pm), that is, pm(o) = o. How- 
ever, the function pm is called upon whenever there is a mismatch between FORM and 
CONTENT properties. This is the case, for instance, with deviations such as syncretism, 
deponency and so on. In the case of active- passive deponency the property mapping 
maps the morphological passive voice forms to the syntactic active paradigm and leaves 
the morphological active voice forms undefined. 

We can ask how paradigm linkage would work for derivation, by taking the example 
of a derivational feature, 5, such as the privative denominal adjective feature, privadj, 
which derives friendless from friend, as described in Stump (2001:252-60).5 Here, we 
are dealing with trivial (two-celled) paradigms, so that the featural mismatches of in- 
flectional paradigms will not be found, and we can work with just a single derivational 
feature, 5. 


(2) Let Corr((rRIEND, privadj)) be the correspondence function for privadj applied to 
the lexeme FRIEND. 
Corr({FRIEND, privadj)) = (Z,5), where Z = |friend|, the stem of FRIEND, and ô = 
privadj. If PF((Z,5)) = (w,5), then DEG Ai) = (w,5), hence, PF((FRIEND, privadj)) 
= (friendless, privadj). 


This seems very straightforward, but there is a hidden difficulty, not immediately ap- 
parent from a language like English, with limited inflection. Consider a hypothetical 
language just like English but in which nouns and adjectives have entirely different in- 
flectional paradigms, or, indeed, consider a derivational process such as that of Subject 
Nominalization which derives DRIVER from DRIVE. What we have to ensure is that the 
output lexeme is (automatically, by default) inflected as a noun, as opposed to the base 
lexeme which is inflected as a verb. As it stands, the Paradigm Function applied to a 
pairing of Lexemic Index and derivational feature will not deliver what Stump calls the 
realized paradigm of the output lexeme. At best, it might define the (or an) uninflected 
stem form of the derived word. Moreover, without significantly altering the nature of 
the Corr function it will be impossible to define additional lexical information such as 


5 A number of authors have proposed that cells in a lexeme's paradigm can be realized by multiword, pe- 
riphrastic, constructions (Sadler & Spencer 2001; Brown et al. 2012; Bonami 2015). Popova & Spencer 
(forthcoming), following suggestions by Bonami (2015), propose that such constructions demand addi- 
tional CONTENT feature sets specifically to define the content of the periphrastic expression, which is 
often at odds with the default feature content of the words which make up that expression. Periphrasis 
therefore provides substantial motivation for a FORM/CONTENT or m-/s-feature (Sadler & Spencer 2001) 
distinction. 

Stump does not discuss derivation, or, indeed, transpositions in his works on paradigm linkage. 
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inflectional class membership, idiosyncratic stem forms or other deviations from default 
inflection, or indeed any non-default or purely morphological property of the output.’ 

A direct solution to this problem might be to enrich the content of the derivational 
feature, 5, so that it incorporates the full set of paradigmatic oppositions realizable by 
the derived lexeme. Thus, the Subject Nominal feature (say subjnom) could be defined 
as the complex [subjnom, NUMBER:a]. This would mean that we would have to define 
the Paradigm Function so that it defines each inflected form of the output lexeme, rather 
than defining an underspecified lexemic entry, which then gets inflected by the stan- 
dard inflectional rules applying to words of that category. For instance, the Paradigm 
Function could take the form PF((DRIVE,{subjnom, NUMBER:sg})) = (driver, {subjnom, 
NUMBER:sg}), PF((DRIVE,{subjnom, NUMBER:pl})) = (drivers, {subjnom, NUMBER:pl]). 
We could call this approach the ‘full-listing approach’. 

Now, the full-listing approach would have the rather peculiar consequence that the 
word forms {driver, drivers } would be inflected forms of the verb DRIVE, and there would 
be no such thing as a DRIVER lexeme. Not only is this entirely counter-intuitive, and 
at variance with any sensible distinction between inflection and derivation, it becomes 
even more counter-intuitive when we see recursive derivation. It would entail that the 
noun reprivatizability was an inflected form of the adjective private. The problem is that 
the Corr function needs to be able to define not a cell in a realized paradigm (what syn- 
tacticians refer to, misleadingly, as the lexical entry of a word(form)), but it has to define 
a featurally underspecified lexemic entry (an autonomous dictionary entry in traditional 
terms). I therefore reject the full-listing approach in favour of that adopted in the GPFM 
model. 

We are now in a position to examine the case of transpositions. The problem we must 
address is how to ensure that the transposition is assigned to its own inflectional para- 
digm, proper to its new morphosyntactic category, whilst still in some sense remaining 
part of the inflectional paradigm of the base lexeme. Recall that I have proposed adopting 
a class of features, [REPRESENTATION:{...}] to define the paradigm space of a transpo- 
sition. I now consider the way this feature can be deployed to define the inflectional 
paradigm-within-a-paradigm of a transposition. Let p = [REPR:x] for some transposi- 
tional relation x, e.g. a verb-to-adjective (participle) transposition. I assume that the 
GPF applied to the pairing (£,p) defines a partially specified representation for the trans- 
position. Normally, when the Paradigm Function applies to a pairing of LI and MPS the 
MPS has to represent a complete and coherent set of features sufficient to define the 
inflected form. However, this is not strictly speaking a property of the PFM system it- 
self. In principle, we could define a partial paradigm for a lexeme by reference to just 
a proper subset of the features required to define any fully inflected word form. For in- 
stance, suppose that verbs in a language inflect for a variety of tense-aspect-mood-voice 
(TAMV) series, and that in addition they show subject agreement. We could, in principle, 
define the stem sets for the TAMV categories independently of the agreement morph- 
ology, by simply not specifying the subject agreement properties, effectively defining a 
set of "screeves" for the language (as in the Georgian descriptive tradition, Anderson 


7 It is also not clear how the additional semantic predicate of the output would be defined. 
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1992: 141-45). This makes use of the same notion of feature underspecification used in 
GPFM to define derivation, but relativized to a specific feature set. 

The GPF for a transposition has to specify the derived morphosyntactic category, x, 
and, by default, the f-/c-MORSIG attribute, associated with that category, together with 
additional morphological properties inherited from the base, such as TAMV properties. 
If the Paradigm Function were to apply in the same way for transpositions as for other in- 
flected forms then the Corr function would have to provide the form correspondents for 
each inflected form, but it would not be able to provide a lexemic entry for the uninflected 
transposition. In other words, standard application of PFM2 principles would give rise 
to a full listing interpretation of the paradigm. It would not be possible to provide any 
characterization of the transposition as a quasi-lexeme (see p. 389). This generates many 
of the same conceptual and technical problems as the application of Corr to derivation. 
One additional consequence is that it would be difficult to describe the very common 
situation in diachronic change in which a participle is reanalysed as a qualitative ad- 
jective, often without significantly changing the lexical semantics of the original verb 
lexeme (transpositional lexeme). If we treat the participle as akin to a lexeme, then we 
can easily define the diachronic reanalysis over that representation, just as we would for 
ordinary derivational conversion. One of the problems with the full-listing approach is 
that it is not clear how we could permit the transposition to inherit MORSIG properties 
of the derived category (Adjective in the case of participles). We therefore need to define 
the GPF in such a way that it allows us to define a quasi-lexeme as part of the paradigm 
of the base. 

First note that application of the Default Cascade to define derived categories is en- 
tirely excluded in the case of (pure) transpositions, since these preserve the meaning, and 
hence the ontological category, of their base; indeed, this is the whole point of a transpo- 
sition. The most natural assumption is that the transpositional morphosyntax effects a 
shift in the syntactic categorization and that morphological recategorization falls out as 
a consequence. (In fact, the extent to which the transposition acquires derived categorial 
properties and loses those of its base is subject to a good deal of cross-linguistic varia- 
tion, tempered by poorly understood typological tendencies. See Malchukov (2004) for 
discussion in the context of action nominal transpositions.) This is effectively a weaker 
instantiation of the Default Cascade. Following Spencer (1999; 2013), I will assume that 
the shift in syntactic representation is actually a modification of the argument structure 
representation; specifically, the definition of a complex sf role (see below). The crucial 
point is that the transpositional GPF changes the SYNCAT value of the base lexeme to 
that of a mixed category and this, ceteris paribus, will automatically entrain a shift in 
the morphological category and hence the MORSIG attributes. 

The basic machinery is only hinted at in Spencer (2013: Chapter ten). In order to de- 
velop an explicit account we first need to refine the definition ofthe REPR(ESENTATION) 
attribute. I will therefore assume that the REPR feature takes an ordered pair as value, not 
a singleton element, [REPR:(x,A) ], where x, A range over lexical categories. Thus, a par- 
ticiple represents a verb as an adjective and hence bears the specification [REPR:(V,A)], 
while a predicatively used noun will have the specification [REPR:(N,V)]. We then cou- 
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ple these feature values to syntactic category specifications in the obvious way, stated 
informally in (3). 


(3) Given [REPR:(«,A)], for A = N, V, A. 
Then SYNCAT = À, 
and by default, MORCAT = A 


In the GPFM framework the [REPR] attribute will be associated with an appropriate 
enrichment of the semantic function role of the ARG-STR attribute. A verb base ARG- 
STR includes E, the event semantic function role, (E, (x,...)). That of the participle is 
enriched by addition of the A semantic function role, to become (simplifying somewhat) 
(Ai (E. (xi....))), where the co-indexing indicates that the element of which the participle 
is predicated is an element of the argument array of the base verb lexeme (the highest 
such argument for Indo-European type participles, though not necessarily so in other 
languages). This representation reflects the mixed categorial nature of the participle. 
While its main external syntax (Haspelmath 1996) is now that of an adjective, it retains 
the eventive ARG-STR of the base verb, which permits it, on a language specific basis, 
to realize a number of verb properties. These typically include the realization of internal 
arguments as verb dependents (complete with quirky case marking). The E semantic 
function role also permits modification by adverbials targeting event semantics. 

Exactly which adjectival properties are acquired and, more crucially for participles, 
which verb properties are lost or retained, differs cross-linguistically. The Indo-European 
participle, for instance, typically retains the active~passive alternation, but may also re- 
tain aspect (Slavic) or tense (Sanskrit; Lowe 2015). As attributive modifiers, the Indo- 
European participles can only modify a noun which expresses the participle’s high- 
est (subject) argument, effectively making them into heads of subject-oriented relative 
clauses, but in other languages there is much greater freedom in the choice of argument 
that can be relativized on by the participle (for discussion see Spencer 2016c). 

In §5 ‘Paradigm linkage rules for Russian’ I present a more detailed analysis of Russian 
participles in which this basic schema for transpositions is expanded upon. 


4 Russian participles 


In this section we look at the set of four participles that are regularly associated with 
Russian verbs. Before we can consider these, however, we need to understand Russian 
verb inflection and the place of the participles in that system. I begin with an overview of 
the grammatical distinctions as a whole made by verbs, in other words, the CONTENT 
paradigm, before considering the actual morphological forms themselves. We encounter 
the familiar problem that there is no consensus on just what the oppositions are and 
how they relate to each other, and so some of my descriptive decisions will be motivated 
in part by expositional convenience. I illustrate with the second conjugation transitive 
verb UDARIT ‘hit’, whose imperfective aspect series is formed by shifting to the first 
conjugation, UDAR AT 
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The simplest categories are the infinitive and imperfective (“present”) and perfective 
(“past”) gerunds. These are indeclinable. Telic verbs such as UDAR'IT/UDAR AT, ‘hit’, 
require imperfective and perfective aspect forms for most of their paradigm. We can 
distinguish three moods, indicative, imperative, conditional. The imperative is straight- 
forward and I will ignore it here. The indicative distinguishes three tenses, present (for 
imperfectives only), past, future. I return to the conditional below. 

Transitive verbs show an active~passive voice opposition. However, the voice system 
is complicated by the fact that imperfective verbs are able to take the reflexive suffix 
-s'a/s'. This has the basic function of giving a reflexive/reciprocal meaning (though sim- 
ilar meanings can also be expressed with fully-fledged reflexive/reciprocal pronouns). 
The reflexive form also has a wide variety of other uses, including the passive (Gerritsen 
1990). This means that we should set up a reflexive voice category and define the imper- 
fective passive in terms of this, but I set this task aside since it is not directly relevant. 
The perfective verbs express the passive alternation periphrastically, with BE + perfec- 
tive passive participle. At the level of the CONTENT paradigm we need to be able to 
define [VOICE:{act,pass}] for both perfective and imperfective verbs, therefore. 

Present tense is expressed morphologically, but only for imperfective verbs. There is 
no dedicated tense marker, and in effect, present tense is realized by the person/number 
subject agreement morphology. The future tense is expressed periphrastically by imper- 
fective verbs, by means of BE + the infinitive form. For perfective verbs the future is 
expressed by a paradigm which is essentially the same as the present tense paradigm 
for imperfective verbs. Thus, the present tense of the imperfective verb pat ‘write’ 
is p isu, pises,... and the future tense of the prefixed perfective verb form NAP'ISAT is 
nap isu, nap ises,.... 

One of the main challenges of the Russian verb system is the representation of the 
past tense. This is derived historically from a periphrastic perfect tense series formed 
by BE and a resultative participle expressed by a suffix JL the l-participle. Syntactically, 
the I-participle behaved like a predicative adjective, agreeing with the subject in number 
and gender, but not in person. The auxiliary BE was lost, leaving the l-participle and its 
adjective-like agreement as the sole exponent of past tense. The agreement inflections 
on an l-participle such as (na)p'isal ‘wrote’ are almost identical to those on a predicative 
(short-form) adjective such as MAL ‘short’: (na)p isal, (na)p isala, (na)p isalo, (na)pisal'i vs 
mal, mala, malo, maly. The only real difference is that in the plural the I-participle stem 
is palatalized but not the predicative adjective stem: (na)p'isal'i vs maly. 

The simple way of analysing the past tense construction would be to take the -/ for- 
mative to be an exponent of past tense and define two distinct sets of subject agreement 
rules for present/future and past tenses. However, the l-participle has one other signif- 
icant usage which precludes this direct analysis. The conditional mood is expressed by 
means of the invariable particle by, a kind of freely distributed enclitic (it can occur any- 
where in the clause except absolute initial position). This can co-occur with the infinitive, 


(4). 
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(4) Esliby skazat’ pravdu, ... 
if BY sayawr truth 


"To be honest, ... 
However, it is much more often found with the l-participle, (5). 


(5) Esliby ty skaza-  pravdu,... 
if BY you say-r-Prc» truth 
‘If you told/were to tell/had told the truth, ..? 


As is indicated in the gloss, the conditional is tenseless, and serves as the translation 
equivalent of past or non-past conditional in English. The existence of the conditional 
construction means that we cannot regard the l-participle simply as the exponent of past 
tense. In fact, it is an instance of what, since Aronoff (1994) morphologists have called 
a "morphome', that is, a pure morphological form, which serves as a stem for building 
up inflected word forms, but which realizes no MPSs on its own and whose distribution 
is not mandated by any semantic, phonological or other non-morphological properties. 
The CONTENT paradigm for Russian is summarized in Table 1, but ignoring the true 
participles. 


Table 1: Russian verb CONTENT paradigm for UDARTT/UDARAT ‘hit’ 


ASPECT imperfective perfective 
INFINITIVE udar at udari-t 
GERUND udar a-ja udari-v(&i) 
IMPERATIVE udar a-j(te)! udar (te)! 
TENSE 

present udar a-ju, -e$, ... <none> 

future bud-u, -eš, ...udar'at udar -u, -iš 

past udar a-], -a, -o, -i udari-l, -a, -o, - i 
CONDITIONAL  udara-l-a,-o,-i- by  udari-l-a,-o,-i- by 
PASSIVE udar at sa, etc (byl) udaren, -a, -o, -y 


Iturn now to the morphological or FORM paradigm. We can divide Russian verb forms 
into five groups. The first is the infinitive and the second the set of two indeclinable 
gerund forms. The third is the set of finite forms, including the imperative mood. In 
practice, these are limited to the present (non-past) forms showing subject agreement 
in person/number. The fourth is the l-participle. Finally, we have the set of (declinable) 
active and passive participles discussed earlier. These are tabulated in Table 2. 

Summarizing Table 2, the verb can/must inflect for aspect. The verb system also shows 
voice alternations. However, these are effected either through reflexive morphology (im- 
perfective verbs) or periphrastically (perfective verbs) so voice proper lacks a dedicated, 
purely morphological exponent, except for the true participles. The infinitive, gerunds 
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Table 2: Russian verb FORM paradigm for UDARIT/UDAR AT ‘hit’ 


ASPECT imperfective perfective 
INFINITIVE udara-t udari-t 
GERUND udar a-ja udari-v 
IMPERATIVE udar a-j(te)! udar (te)! 
TENSE 

present/future ` udar'a-ju, -eš, ... udar -u, -i8 
L-PTCP udara-l-a,-o,-i  udari-,-a,-o,-i 
REFLEXIVE udar at sa, etc <none> 


and the l-participle do not show any tense oppositions. The subject agreement shown by 
l-participles differs from that of finite verb forms in that it is defined in terms of MPSs 
proper to predicative adjectives, not those of finite verbs. 


Table 3: CONTENT feature array 


ASPECT {ipfv,pfv} 

VFORM INFINITIVE 
TENSE:{prs, fut, pst} 
IMPERATIVE:{sg, pl} 
CONDITIONAL:{yes, no} 

REFLEXIVE f{yes, no} 

AGRSUBJ PERSON: 2, 3} 
NUMBER:{sg, pl} 
GENDER:{m, f, n} 

VOICE (ACTIVE, PASSIVE} 

REPR IVAN, (V,Adv)} 


In Tables 3, 4 I provide a summary list of the features which populate the CONTENT 
and FORM paradigms. I have provided only basic labels for the various MPSs. Ideally, we 
would want to know how they are grouped together, if at all, in the two paradigms. This 
is a difficult question, and I finesse it by just assuming what is effectively a list structure 
for the MPSs, with a number of dependency statements between them. Thus, I have 
not distinguished, say, a finite from a non-finite set of forms or constructions. However, 
for the purposes of giving a broad-brush characterization of the morphology~syntax 
mapping this is probably not a problem. 

From this overview we can see that there is a very clear divide between the CONTENT 
paradigm MPSs required to describe the system as a whole and the FORM paradigm 
MPSs required to describe the individual word forms. These tables ignore the inflectional 
paradigms of the participles, of course, but adding them will just serve to emphasize the 
CONTENT~FORM disparity. 
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Table 4: FORM feature array 


ASPECT {ipfv,pfv} 

VFORM INFINITIVE 
TENSE:{prs-fut} 
IMPERATIVE:{sg, pl} 
L-PTCP 

REFLEXIVE ` {yes, no} 

AGRSUBJ PERSON: 2, 3} 
NUMBER:{sg, pl} 
GENDER:{m, f, n] 

REPR IVAN, (V,Adv)} 


Russian has four participles, active~passive and perfective~imperfective, which are 
typical attributive modifiers with the agreement morphosyntax of standard adjectives. 
However, they retain a variety of verb properties, making them into typical examples of 
mixed categories. Thus, in (6a), the imperfective active participle upravl'ajuscij takes a 
temporal PP adjunct and assigns instrumental case to its complement, just like the finite 


verb (6b). 


(6) a. (éCelovek-a), upravl a-jus¢-ego v tecenie mnogo let 
(the.person[M]-GEN.SG) run-PRSPTCP-M.GEN.SG in course many of.years 
mestnoj  &koloj 
local.INSTR school.INSTR 
‘of (the person) (who has been) running the local school for many years’ 

b. Ivanov upravlaet v tečen'ie mnogo let mestnoj  &koloj 
Ivanov runs in course many of.years localiNsTR school.iNsTR 


‘Ivanov has been running the local primary school for many years’ 
The participles are often described as present/past tense forms, but their semantics is 
essentially aspectual and they fit somewhat better into the overall verb scheme if they 


are treated as perfective~imperfective pairs. They are summarized in Table 5 (cf Wade 
1992: 361). 


Table 5: Russian participles of the verb UPRAVIT/UPRAVI AT ‘control’ 


Aspect  imperfective perfective 


Active — upravlaju-8&- | upravi-vi- 
Passive  upravla-em- ` upravl-on(n)- 
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Perfective aspect participles, for semantic reasons, usually only have past time refer- 
ence? The imperfective participles realize a time relative to the main verb of the clause, 
so that ‘present tense’ is a particularly misleading label for these forms (Wade 1992: 375). 


(7) Ja videl/vizu sobak-u, bega-ju&é-uju po beregu 
I saw/see | dog[r]-Acc.sc run-ACTPRSPTCP-F.ACC.SG along the shore 


‘I saw/see the dog running along the shore’ 


The participle differs from the finite form in this respect. Example (8) would only be 
possible with either the meaning ‘(dog) which usually runs along the shore’ or as a 
somewhat marked form of the historic present (cf Russkaja Grammatika I: 665). 


(8) Ja videl sobak-u, kotor-aja begaet po beregu 
I saw dog[F]-acc.sc which-F.Nom.sc runs along the shore 


‘I saw the dog which is running along the shore’ 


In this respect, Russian participles are just like their English counterparts, of course. 

The participles have a number of properties aligning them with verbs. In addition to 
realizing the purely verbal (eventive) categories of tense-aspect-voice, the active partici- 
ples can take reflexive forms, either as reflexive variants of non-reflexives, inheriting all 
the semantics of the reflexive forms, upravl'at (s'a) ‘manage, control’ ~ upravl'ajusijs'a, 
or as inherent reflexives (with no non-reflexive counterpart), bojat’s‘a ‘fear’ ~ bojasci- 
js'a. As we have seen, syntactically, the participles retain the verb's argument structure, 
including quirky case assignment to complements, such as instrumental in the case of 
upravl ajuscijs‘a (see examples (6)) and genitive in the case of bojascijs'a. 

On the other hand, the participles have a number of adjectival properties. The most 
salient morphosyntactic property is that of attributive adjective agreement together with 
the morphological property of belonging to a well-defined adjectival inflectional class. 
(As attributive modifiers the participles can be restrictive or non-restrictive, like other 
attributive modifiers, including relative clauses.) However, they can also be used as pred- 
icates with the copula BYT ‘be’ or with “semi-copulas” such as sTAT ‘become’, KAZAT'S'A 
‘seem’, OSTAT'S A ‘remain’, and others. Most commonly it is the perfective passive par- 
ticiples that can be used as predicates but active participles can also be found in this 
role (Svedova 1980b: 291, 82346). The passive (though not the active) participles can also 
appear as predicates, in the so-called short form, a typically adjectival property.’ 


(9) uzin uze poda-n 
supper.M.sc already serve-PASSPTCP.M.SG 


‘Supper has already been served’ 


8 But see the counter-examples in the Academy of Sciences grammar Russkaja Grammatika, I: 667, 
pred javl ajuscij ‘presenting’, vzvolnujuscij ‘exciting’, sdelajuscij ‘doing’, smoguscij being able’. 

? Present active participles can be used in the short form, however, when they are converted into true, qual- 
itative, adjectives (Svedova 1980a: 666). 
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Given this brief descriptive summary of the basic facts of Russian participles we can 
turn to the central questions: how do we represent participles in a formal, constraints- 
based grammar with an inferential-realizational morphology? How do these representa- 
tions relate to the inflection~derivation divide and the issue of split morphology? 


5 Paradigm linkage rules for Russian 


In the extension to GPFM presented here, the FORM paradigm and the CONTENT pa- 
radigm are modelled by the attributes f- MORSIG, c-MORSIG. However, those types of 
lexical relatedness which modify the MPSs of a representation, such as transpositions, 
willipso facto modify the content of Stump's FORM/ CONTENT paradigms and the f-/c- 
MORSIG attributes. The GPF for such types of relatedness therefore has to specify that 
novel content, by stipulation, if necessary. In the rules I propose below I show how this 
can be achieved for Russian conjugation. The reference to MORSIG is taken to mean 
c-MORSIG by default and by default, this is identical to f- MORSIG. 

I shall begin by specifying the CONTENT and FORM MORSIG attribute for non- 
participial verb categories. We need to define the f- MORSIG in part in terms of the c- 
MORSIG and in part independently. The default is the identity mapping from c-MORSIG 
to f- MORSIG. The c-MORSIG list shown earlier in Table 1 is defined for any lexical rep- 
resentation whose ARG-STR includes the E semantic function role. The MPSs that are 
shared across the CONTENT and FORM paradigms of Russian verbs are fairly limited 
(cf. Tables 1 and 2 above). They are (ignoring for the present the participles and gerunds): 
ASPECT :(ipfv, pfv}, VFORM:{INFINITIVE, IMPERATIVE:{sg, pl}, VOICE:{act, pass}, and 
AGRSBJ:{PERSON/NUMBER/GENDER}. 

The c-features ASPECT, VFORM:(INFINITIVE, IMPERATIVE:{sg, pl}} have relatively 
straightforward f-feature correspondents. I shall ignore INFINITIVE and IMPERATIVE 
for present purposes. AGRSUBJ is also a FORM property but with some complications 
and I return to it when I discuss the l-participle. 

The status of the TENSE feature is a little unclear. At the CONTENT level there are 
clearly three values, prs, fut, pst, but only the prs and fut values have a FORM corre- 
spondent, and even then the value of the FORM correspondent is a composite prs/fut, 
and therefore not a direct correspondent of either c-[TENSE:prs] or c-[TENSE:fut]. The 
c-TENSE:{pst} property is realized by the morphomic l-participle, and not by any dedi- 
cated f-TENSE:{pst} property. I shall therefore assume a univalent FORM property, TNS, 
itself a value of VFORM, realizing c-TENSE:f[prs, fut} depending on the value of ASPECT. 
This replaces the atom-valued TENSE:{present-future} shown in Table 2 above. The prop- 
erty VOICE:{act, pass} is intriguing. It is a CONTENT property of the verb system but it 
is not a FORM property of any part of the verb system proper, outside of the participle 
subsystem. However, the participles distinguish active and passive sets, and the per- 
fective passive participle is actually part of the periphrastic exponence of the syntactic 
(CONTENT) VOICE property. Moreover, the two passive participles have the passive 
SYN|ARG-STR representation, namely, ...(E((x), y, ...), where (x) denotes the demoted 
active subject argument role. Therefore, VOICE is both part of the CONTENT and the 


406 


18 Split-morphology and lexicalist morphosyntax: The case of transpositions 


FORM paradigm, albeit with a somewhat complex realization, which will require the 
f-MORSIG attribute to be modified by a feature co-occurrence statement restricting f- 
VOICE to the participles. The other CONTENT paradigm features are represented by 
forms which are effectively periphrastic. It is for these reasons that the FORM MPSs 
have to be defined independently, as shown below. 

One attribute that is shared across FORM/CONTENT paradigms is REPRESENTA- 
TION. The only reason for a language to define a true transpositional category is to 
allow a lexeme to assume the syntactic distribution of a word of a different class, so 
REPR clearly has to be a CONTENT feature. In languages such as Russian, in which 
participles are marked morphologically, this also means that REPR is a FORM feature. 
I shall assume that the REPR attribute as applied to verbs has three values. The first is 
plain (or (V,V)), the ‘identity representation’, and the default value. Where no indication 
of REPR is given the default is to be understood. The second value of REPR is (V, A), 
defining the four participles. The third value is (V,Adv),!° defining the imperfective and 
perfective gerunds. I shall ignore the gerunds for simplicity of exposition. This means 
that I shall only mark the participles explicitly. 

We now define the content of the c-MORSIG attribute by reference to the verb’s SYN 
value. The aspects, and voice are defined with the mapping shown informally in (10). 
This will apply to any representation whose ARG-STR contains the E sf role. In practice, 
this means verbs, participles, and gerunds. 


(10) ...E...— ASPECT, VOICE c MORSIG 


I return to the c-MORSIG|CONDITIONAL, TENSE:{pst} mappings below when I discuss 
the status of the l-participle. 

The feature array defining participles is given in Fig. 1. This is the "derived" MORSIG 
attribute for any lexical representation defined by the feature REPR:(V,A). The property 
sets labelled | 1 | in Fig. 1 come from the MORSIG attribute of the base verb by virtue of 
(10). The property sets | 3 |,| 4 | are those which ensure that the participles are inflected 
like adjectives. I return to these when I have introduced adjective inflection. 

The key to my analysis of transpositions is to incorporate part of the analysis of 
derivational morphology into the definition of the transposition's entry. Given a feature 
pairing (£,p), where £ is a lexemic index and p contains a value of REPR (for instance, 
[REPR:(V,A)] for participles), the GPF applied to this pairing leaves the LI and the SEM 
representations of the base unchanged, but enriches the SYN|ARG-STR attribute by cre- 
ating a complex semantic function role (A;(E(xi, ...))). The coindexation guarantees 
that the noun head modified by the participle is identified with the highest thematic 
argument of the base verb's argument array, i.e. its SUBJECT. The FORM function com- 
ponent of the GPF defines the stem set for the participle, but underspecifies all other 
FORM information, including the MORSIG (and the CONTENT paradigm MORSIG is 
also underspecified by the SYN function). 


10 The semantic function role label Adv stands proxy for whatever the appropriate way is of defining adverbs 
as distinct from adjectives. 
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ASPECT ipfe. piv) 


[S 


VOICE fact, pass} 


NUMBER [sg pl} 
MORSIG 


CONCORD |GENDER Im. £n] 3 


CASE {nom, i] 


MORCLASS adj 


D 


Figure 1: Feature structure for Russian participles 


The lexemic entry for a typical transitive Russian verb such as UDARIT/UDAR AT is 
that shown in Fig. 2, where V stands for the verb's lexemic index. The lexemic entry's 
value for LI is just the LI of the lexeme, of course (the GPF here does not describe a 
process of derivational morphology). Note that for this lexeme the SYN attribute, too, 
is completely underspecified, lacking even the ARG-STR attribute. The value of that 
attribute is determined by default from the Event ontological type of the SEM represen- 
tation. However, the REPR feature which defines transpositions introduces a realization 
rule which has to be defined over a specified ARG-STR representation. Therefore, the 
ARG-STR attribute has to be part of the lexeme's (SYN attribute's) morpholexical sig- 
nature (inflection-like argument structure alternations such as passive or antipassive 
impose a similar requirement). In this respect, the transpositions are like inflection and 
not like derivation. 

We can informally state the realization rule which defines adjectival representations 
of lexemes (transpositions-to-adjective) as a function o from ARG-STR representations 
to ARG-STR representations, as shown in (11), where (11a) defines a participle's ARG-STR 
and (11b) defines that of a relational adjective. 


(I) a o((E(x,...))) = (Ai(E(X,...))) 
b. o((R)) = (A (RD) 


Given the lexemic entry in Fig. 2 and the realization rules for Russian morphology, 
the GPF for the imperfective active participle, udar'ajusc- will map to a partially un- 
derspecified lexical representation, as shown in Fig. 3. A representation such as this is 
the *quasi-lexeme' discussed earlier. Like a simple lexemic entry, or an entry defined 
by a derivational GPF, it needs to have its MORSIG attributes specified in order to be 
inflectable. I turn now to how those MORSIG entries are defined. 

The partially specified MORSIG we need to be able to define for participles is that 
shown in Fig. 4. The ASPECT/VOICE properties are shared with verbs. This can be 
achieved by writing the rules defining the MORSIG of verbs and participles in such as 
way as to refer either to the “outermost” E semantic function role of the ARG-STR at- 
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PHON udar 
SE MORCLASS V|CONJ2 
fform (Y,u) = 
PHON udar aj 
STEM1 
MORCLASS V|CONJI 
fonl((V, u)) = u 
fsem((V, u)) = Axy[HIT(xy)] 
ICH, u )) = u 


Figure 2: Lexemic entry for UDARIT/UDAR AT 


FORM((V, {REPR:(V,A)})) 


STEMO|PHON  udar'aju&é 
MORSIG u 


SYN((‘V, {REPR:(V,A)})) ARG-STR (A; (E(x;,y))) 


MORSIG u 
SEM((V, (REPR:(V,A)D) - identity function 
LI((V, (REPR:(V,A) D) - identity function 


Figure 3: Underspecified entry for udar‘ajusé(ij) 


ASPECT: (u) 

VOICE: (u) 
MORSIG 

CONCORD:{u} 


MORCLASS: ADJ | DECL1/2 


Figure 4: MORSIG for Russian participles 
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tribute, or the “embedded” E role found with participles." The CONCORD attribute, | 3 |, 
comes from the generic c-MORSIG of an adjective, shown in (12).! 


(12) .. A... = [CONCORD:{NUMBER, GENDER, CASE}] c MORSIG 


The [MORCLASS adj] specification, | 4 |, is strictly speaking a stipulation, except that 
in Russian (in contrast to, say, Latin) all participles belong to the default adjectival class, 
DECL1/2. 

Given this machinery we can now account for the transpositional mixed category of 
participle by application of the (quasi-inflectional) generalized paradigm function apply- 
ing to a verb and delivering its participial forms, triggered by the [REPR] feature. These 
representations will be underspecified for (adjectival) CONCORD features. For concrete- 
ness, consider the imperfective active participle, udar ajusc-. Given p = (REPR:(V,A), AS- 
PECT:ipfv [+], VOICE:act[2]}, then for V = uDARIT/UDAR AT, the GPF((V,p)) will apply 
to a lexical representation which is derived from the lexemic entry for UDARIT/UDAR AT 
whose MORSIG attributes have been fully specified, allowing the lexeme to be inflected 
(in the broad sense of this term, including “inflection” for participle formation). This 
GPF will deliver a partially underspecified lexical representation for the participle. The 
GPF will specify the participle's stem form(s), the ASPECT, VOICE features which define 
that particular participle, and the enriched ARG-STR attribute with complex semantic 
function role. 

The lexical representation which is input to the GPF is shown in Fig. 5 and the lexical 
representation of the participle is shown in (13). In Fig. 5, STEMO denotes the perfec- 
tive stem, which is effectively the lexeme's root. CONJ2 is second conjugation, and this 
means that most inflectional rules will be defined over another stem, udari-, derived 
by regular rules of stem formation. STEMI denotes the imperfective stem, a member of 
CONJI, the first conjugation, whose inflectional stem is therefore udar'aj-. Attributes 
which belong to both FORM and SYN (CONTENT) paradigms are tagged to make them 
more easily identifiable. 


(13) fform = STEM-iap 6 šč = udar’aju-8¢- 
fsyn = ARG-STR:(A;(E(x;,y))) 


where STEM-iap denotes the imperfective active participle stem, derived from STEMI. 

We have now achieved our goal of defining participles. In effect, we have defined 
the lexemic entry for a class of adjectives, whose peculiarity is that they are marked for 
verbal voice and aspect and they share their semantics and lexemic index with a parent 
verb. 

It is instructive to compare the behaviour of the true participles with that of the 1- 
participle. While the true participles are mixed categories, the l-participle is essentially 
a verb form with unusual agreement morphology. It is not entirely clear how best to 


H Russian action nominals are transpositional lexemes and not true transpositions. 
12 Members of the semantically defined class of quality or scalar adjectives will also have the feature COM- 
PARISON added to their MORSIG to define comparative/superlative morphology. 
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STEMO0 PHON | udar 
MORCLASS V | CONJ2 


PHON 
STEM1 


MORCLASS V | CONJ1 


ASPECT: ipfv, pfv 

VOICE:act, pass 
TNS 

VFORM:| LPTCP 


Di 


FORM 


MORSIG 


Le 


CONCORD {N UMBER, PERSON, GENDER} 


" 


REPR{ (V.A), (V.Adv)} 
Other verb FORM MPSs 


ES 


ARG-STR: (E(xiy)) 


ASPECT] ipf, pfo) 
vorce{ act, pass} 


a 


Di 


TENSE pst, prs, fut} 
SYN MORSIG 


Le 


CONCORD{ NUMBER, PERSON, GENDER] 


D 


REPR: (VA), (V.Adv)} 


ARG-STR:(sf role (0 array)) 
Other verb CONTENT MPSs 


a 


ES 


SEM AXY Levent HIT(x,y)] 
LI UDARIT /UDAR AT 


Figure 5: Lexical representation of the lexeme UDAR’IT’/UDAR’AT’ ‘to hit’ 
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account for the peculiar agreement properties of the l-participle in the past tense and 
conditional constructions, but the simplest (if somewhat crude) way to do this is to as- 
sume that verbs in general agree with their subjects in person, number, gender features, 
but that in the past/conditional forms agreement is restricted to number, gender, while 
in the present tense forms it is restricted to person, number.” 

I conclude, then, that the l-participle is a verb form that inflects just like a (subtype 
of) predicative (short-form) adjective. Assuming a feature ADJDECL covering adjectival 
morphology generally, we can distinguish several sub-types of declension, including 
that for the predicative adjectives, [ADJDECL:predadj]. The l-participles will belong to 
a subtype of this class, [ADJDECL:predadj:lptcpdecl]. The ADJDECL feature is part of 
the MORSIG attribute of ordinary adjectives, as defined by the paradigm linkage rule in 
(14).4 


(14) ..A...2 ADJDECL c MORSIG 


We also compare true participles with qualitative adjectives derived by conversion 
from participles. These are a type of transpositional lexeme. The theoretical significance 
of this type of lexical relatedness for current morphological models was first identified, as 
far as I am aware in Spencer (2013: 275), where I discuss English words such as PREPOSI- 
TIONAL, from PREPOSITION. These look like relational adjectives (noun-to-adjective trans- 
positions), because their lexical semantic content seems to be identical to that of their 
base noun (prepositional phrase means the same as the compound preposition phrase, 
for instance). However, the English adjectives are syntactically opaque: monosyllabic 
prepositional phrase does not mean ‘phrase headed by a monosyllabic preposition’ but 
only ‘monosyllabic phrase headed by a preposition’ (though that interpretation is pos- 
sible for the compound, monosyllabic preposition phrase). In other words, the adjectival 
expression only has the structure monosyllabic [prepositional phrase], not [monosyllabic 
preposition]al phrase. 

English has a large number of qualitative adjectives derived by conversion from par- 
ticiples, which are also instances of transpositional lexemes (Spencer 2016a): amazed/ 
amazing, bored/boring, challenged/challenging, interested/interesting, .... In very many 
cases it is not possible to identify a meaning difference between the adjective and the 
etymological verb base: This book bores me/This book is boring. Russian participles like- 
wise are often converted into qualitative adjectives: potr'asajustij ‘amazing’, vyzyvajuscij 
‘provocative, defiant, challenging’. In Russian it is often possible to determine that a word 
with the shape of a participle is actually an independent adjective, since only true ad- 
jectives have the short predicative form: uspexi potr'asajusci (plural, from potr asajusc) 


13 A more sophisticated approach might define agreement in a morphology-driven fashion by stating that 
the agreement features that the syntax can manipulate are restricted to those that can be expressed by a 
particular morphological form, so that it is the morphological MORSIG that determines which features 
trigger agreement, even in identical syntactic positions. 

14 As adjectives, we might expect the participles to have predicative forms, too. This is true, however, only for 
the passive participles, especially the perfective passive, which has a special stem form ending in a singleton 
/n/, nap‘isan ‘written’, in contrast to the attributive form with geminated /nn/: (v speske) nap'isannaja 
(zapiska) ‘(a hastily) written (note)’. 
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‘the-progress is-amazing’ (lit. ‘the-successes are-amazing’), ego poveden Ze vyzyvajusce 
‘his behaviour is-defiant’. True participles do not have the predicative form (Russkaja 
Grammatika II, p.666). English converted participles fail to inherit the complementation 
properties of the base verb: The obstacle course challenged the stamina of the athletes~ The 
obstacle course was very challenging (*the stamina of the athletes). Russian transpositional 
adjective lexemes behave likewise. The transitive base verb Borg ASAT’ ‘to amaze’ gives 
us uspexi potrasali nas ‘the-successes amazed us’, but the (active) transpositional lex- 
eme cannot take a direct object: "uspexi potr'asajusci nas. The true participle remains 
transitive: potr'asajuscie nas uspexi ‘progress (successes) which amaze us’. 

We thus have a double dissociation of properties: on the one hand, we have the l- 
participle which has the form of a predicative adjective but which realizes finite (tense/ 
mood) verb properties and retains the full complementation properties of the verb, and 
on the other hand we have participle-like adjectives which, while allowing predicative 
adjective forms, lack all the crucial morphosyntax of verbs. In between we have the true 
participles, with the external morphosyntax of an adjective but the complementation 
properties of the base verb. 


6 Conclusions: Transpositions and split morphology 


I have argued in this paper for a view of morphosyntax which recognizes a word/phrase 
distinction and, given that, a distinction between (abstract) lexemes and (concrete) in- 
flected word forms of those lexemes, valid for very nearly all known languages. I have 
also assumed that languages can increase their stock of lexemes by means of deriva- 
tional morphology, and that in some cases this is sufficiently regular to be regarded as 
paradigmatic, hence, part of the grammatical system proper. The inflection/derivation 
distinction is controversial, however, because it is often difficult to know where the 
boundary actually lies and where to place intermediate types of lexical relatedness. The 
transpositions, as exemplified by the Russian participial system discussed here, represent 
a particularly troublesome case-in-point. 

Participles and other transpositions are often treated as a type of derivational morph- 
ology, because they involve a shift in word class, but this is a wrong characterization. 
Participles are part of a verb's paradigm, they are not a type of lexical stock expansion 
(Beard 1995). Nonetheless, participles inflect like adjectives, not like verbs and thus seem 
to straddle the inflection/derivation divide in a way that calls that very distinction into 
question, and, on the face of it, even provides support for models in which all morphol- 
ogy is just syntax by other means (Minimalism/Distributed Morphology) or in which all 
syntax is just morphology by other means (American morphemics). Here I have claimed 
that, on the contrary, we can only make sense of participles against a set of background 
assumptions that contradict monolithic models in which there is no autonomous morph- 
ology module (Aronoff's *morphology-by-itself"), so that morphology is no more than 
syntax by other means. The crucial observation is that derivational morphology induces 
a type of lexical opacity which is lacking in transpositions, which, by contrast, show the 
kind of lexical transparency associated with inflected forms. 
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In the GPFM model, canonical derivation is a relation between maximally underspeci- 
fied (minimally redundant) lexical representations (lexemic entries), consisting of a spec- 
ification of the basic root form (FORM|PHON) and the ontological/semantic representa- 
tion (SEM). The morphosyntactic properties are then projected from these by default 
mappings. Canonically, derivation enriches the PHON and the SEM representations, 
and the Default Cascade then specifies the morphosyntactic properties of the derived 
lexeme. One consequence is that there will then be no “trace” left of the morphosyntactic 
properties of the base lexeme, such as its word class or its argument structure. This au- 
tomatically guarantees most of the lexical opacity/lexical integrity effects familiar from 
the literature. In some (noncanonical) cases it may be necessary to stipluate overrides 
of lexical information as part of the derivational Generalized Paradigm Function. This 
might be true if, for instance, a base lexeme belongs to a lexical category which is not the 
default for its ontological class. For instance, a language with a distinct lexical category 
of (qualitative) adjective may also have non-derived stative verbs whose denotations are 
of the ontological type Property, and which by the Default Cascade should be adjectives, 
not verbs. Such a verb would have to have its ARG-STR|(E ...) value prespecified. If such 
a lexeme were the input to a verb-to-noun derivational function (nominalization), then 
that ARG-STR would have to be overridden by the nominalization function, replacing it 
wholesale with the ARG-STR|(R) value. Exactly how such cases are to be handled has 
to remain a matter for future research. 

No such opacity is found with canonical inflection: in general, the syntax treats a 
noun as a noun no matter what its number, case, definiteness, ... value. Thus, although a 
locative case marked noun would normally be restricted to functioning as an adjunct or 
the complement of a class of adpositions, it can still be modified by an adjective, just like 
any other noun form, so that for a noun ‘house’, new house-Loc means ‘at a new house’. 
In this respect, a locative case marked noun typically differs from a derived denomi- 
nal lexeme denoting a location. Many languages have a denominal derivational marker 
meaning ‘place where there is/are NOUN, place associated with NOUN’: N-PLACE. Typ- 
ically, the base noun, N, is not accessible to morphosyntax, so that an expression such 
as new house-PLACE could only mean ‘new place where there is a house/are houses’ and 
not ‘place where there is/are a new house(s)’. Thus, inflection differs from derivation in 
being lexically transparent. 

The significance of these rather obvious points about inflected forms is that we are 
far less able to make similarly categorical claims where transpositions are concerned. A 
participle behaves in the syntax to some extent as though it were an inflected verb form, 
but not entirely. In GPFM the lexical representation of a transposition exhibits trans- 
parency by virtue of being a member of the base lexeme’s (extended) paradigm, that is, 
by bearing the same LI as the base. In this respect it is not an autonomous lexeme. On the 
other hand, the transposition exhibits the external syntax of a distinct (derived) lexical 
category. The extension to the GPF proposed here permits us to model this “inflectional- 
paradigm-within-a-paradigm” effect in a way that reflects the lexical transparency of 
the transposition while also allowing us to state restrictions on full transparency as a 
restriction on the MORSIG of the transposition. 
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A crucial aspect of the analysis is the distinction between FORM/CONTENT proper- 
ties (m-/s-features). Without at least this level of differentiation we cannot make sense 
of Russian participles and we cannot distinguish them from the verbal l-participle form 
or the transpositional lexemes. A second crucial aspect of the analysis of any type of 
transposition is the Lexemic Index (LI). In the general case, there is no combination of 
lexical properties that will uniquely serve to individuate lexemes, but it is nonetheless 
essential to impose such an individuation to account for the patterns of lexical related- 
ness that are found across languages, specifically, to distinguish true participles from 
departicipial converted adjectives (transpositional lexemes). 

The combination ofthe FORM/CONTENT paradigm distinction (or its equivalent) and 
the Lexemic Index allow us to define not just canonical inflection but also non-canonical 
intra-lexemic types of relatedness such as that shown by transpositions. That combi- 
nation also serves to reconstruct the split in the morphology argued for originally by 
Anderson. Indeed, split morphology is entailed by this set of assumptions, except that 
in a constraints-based model the split is defined in terms of access: inflectional morph- 
ology is that which permits syntax to retain access to the properties of the base lexeme 
(lexical transparency), while derivational morphology permits no such access (lexical 
opacity/integrity). 

The more articulated view of morphosyntax proposed here allows us to pose a ques- 
tion which was not at the forefront of debate over the question of split morphology, as 
far as I am aware: which side of the split do transpositions fall on? The answer, given the 
foregoing, is "both". The transposition is inflectional by virtue of preserving the base's 
LI and by virtue of the, at least partial, transparency of the base's properties. It is deriva- 
tional by virtue of the fact that it defines the paradigm of a quasi-lexeme, within the 
paradigm of the base. But it would be difficult to make sense of this conclusion without 
assuming the basic split in the first place. 


Appendix: Illustration of the lexical representations 
assumed 


To specify, say, the 3sg form drives, the GPF((DRIVE,{3sg})) applies to the output of Figure 
6 to specify the FORM value |STEM0ez|, leaving other aspects of the representation 
unchanged. This is equivalent to the operation of the paradigm function in PFM1 and 
the output of the Corr function in PFM2. 
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FORM  STEMO | drarv | 


SYN — 
— 
SEM [Event AX, y.drive(x.y)] 
LI DRIVE 
STEMO | drarv | 
TNS {prs, pst| 
FORM 
MORSIG |VFORM {ing-form, ed-form, basel 
AGR Ion, none] 
SYNCAT V 
ARG-ST  (E(xy)) 
TNS {prs, pst| 
SYN 


MORSIG ASP {simple, prog, pert} 


AGR lose. none] 


SEM [Event Ax; y.drive(x,y)] 
LI DRIVE 


Figure 6: Illustration of the application of the GPF for standard inflection. 
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(15) 


DRIVE = DRIVER 


Where ô = SubjNom, È = [Event Ax; y.drive(x.y)], GPF((DRIVE,5)) = 


STEMO(pRIVE)@ | ə | 


FORM  STEMO |drarv | 

SYN — 

SEM È 

LI DRIVE 

FORM [STEMO 

SYN — 

SEM [thing AX.PERSON(x) A X 
LI S(DRIVE) 


Figure 7: Illustration of the application of the GPF for standard derivation 


The output of this GPF then undergoes specification of MORSIG by the Default Cas- 
cade, which defines the derived lexeme as a syntactic and morphological noun. 


Abbreviations 
AGRSUBJ subject agreement 
act active 
ARG-STR argument 
structure 
CONJ conjugation 
DECL declension 
GPF Generalized 
Paradigm 
Function 
GPFM Generalized 
Paradigm 
Function 
Morphology 
ipfv imperfective 
LI lexemic index 


L-PTCP, I-ptcp 


l-participle 


MORCAT 


MORCLASS 
MORSIG 


MPS 


pass 
PF 
PFM 


pfv 
PHON 
REPR 
SEM 
SYN 


morpholexical 
category 
morphological class 
morpholexical 
signature 
morphosyntactic 
property set 
passive 

Paradigm Function 
Paradigm Function 
Morphology 
perfective 
phonological form 
REPRESENTATION 
semantics 

syntax 
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In a series of publications, Stephen Anderson developed the idea that the definition of a lan- 
guage’s inflectional morphology involves blocks of realization rules such that (i) realization 
rules’ order of application follows from the ordering of the blocks to which they belong and 
(ii) realization rules belonging to the same block stand in a relation of paradigmatic opposi- 
tion. A question that naturally arises from this conception of rule interaction is whether it is 
possible for the same rule to figure in the application of more than one block. I discuss two 
systems of verb inflection exploiting exactly this possibility - those of Limbu and Southern 
Sotho. In order to account for the special properties of such systems, I argue that in the def- 
inition of a language’s inflectional morphology, one rule may be dependent upon another, 
and that in such cases, the dependent rule may figure in the application of more than one 
block precisely because the “carrier” rules on which it is dependent differ in their block 
membership. In formal terms, this means that the definition of a language’s inflectional 
morphology may draw upon principles of rule conflation by which a dependent realization 
rule combines with its carrier rule to form a single, more complex rule, typically occupying 
the same block as the carrier rule. I further show that there is considerable independent 
motivation for the postulation of these principles. 


1 Introduction 


In a series! of articles culminating in his 1992 monograph A-morphous Morphology, Ste- 
phen Anderson developed a model for the precise inferential-realizational definition of 
complex inflectional patterns.’ 

Two central principles of this model are (1) and (2). According to (1), the definition of 
the Latin word form laudà-ba-nt-ur ‘they were being praised’ involves the realization 
of a morphosyntactic property set through the interaction of ordered rule blocks. One 
of these houses a rule realizing the imperfect indicative through the suffixation of -ba; 


1 Key references include Anderson 1977; 1982; 1984a,b; 1986. 

? [n the typology of morphological theories proposed by Stump (2001), a theory is inferential if it employs 
rules to infer the form of a language's words and stems from that of less complex stems; a theory is real- 
izational if its definition of a language's morphology takes a word's content as logically antecedent to its 
form. 


Gregory Stump. 2017b. Rules and blocks. In Claire Bowern, Laurence Horn & Raf- 
| faella Zanuttini (eds.), On looking into words (and beyond), 421-440. Berlin: Language 
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this is followed by a block housing a rule realizing third-person plural subject agreement 
by the suffixation of -nt; this, in turn, is followed by a block containing a rule realizing 
passive voice through the suffixation of -ur. The successive application of these rule 
blocks infers the word form laudàábantur from the stem lauda- as the realization of the 
property set {3 pl imperf ind pass} in the paradigm of the lexeme LAUDARE ‘praise’. 


(1) A language’s inflectional rules are organized into ordered blocks such that two 
rules’ order of application depends on the ordering of the blocks to which they 
belong. 


(2) Rules belonging to the same block are disjunctive: at most one rule per block 
applies in the realization of a given word form. In general, competition between 
rules is resolved in favor of the rule with the narrower domain of application. 


According to (2), the definition of a word form by a sequence of rule blocks involves 
the application of at most one rule per block. Two rules belonging to the same block may 
be defined so as to apply in disjoint contexts; for instance, the rule realizing third-person 
plural subject agreement through the suffixation of -nt realizes different property sets 
from the rule realizing first-person plural subject agreement throught the suffixation of 
-mus. But it can also happen that two rules belonging to the same block are both in 
principle applicable in the same context; for instance, -i and -ø both realize first-person 
singlar subject agreement and might therefore be seen as entering into competition in 
the realization of certain forms. Given that the -i rule applies only in the first-person 
singular perfect indicative active (e.g. laudàvi ‘I have praised’), its domain of application 
is narrower than that of the -o rule, which apparently applies as a default, surfacing 
in the present and future indicative active (laudo ‘I praise’, laudabo ‘I will praise’) and 
passive (laudor ‘I am praised’, laudabor ‘I will be praised’) as well as in the future perfect 
indicative active (laudavero ‘I will have praised’); accordingly, the -i rule overrides the 
-6 rule in the realization of the first person singular perfect indicative active. 

Anderson's model has afforded the most plausible existing accounts of a diverse range 
of inflectional systems (see, for example, the analyses of Potawatomi and Georgian in 
Anderson 1977; 1984a; 1986 and that of German in Zwicky 1985), and it continues to raise 
important theoretical questions. One such question is whether the same rule? may figure 
in the application of more than one rule block. I argue here that in a particular class of 
cases, this is precisely what happens. 

In the cases in question, there is always a relation of dependency among particular 
rules. Harris (2017) describes relations of this sort in affixal terms as involving a depen- 
dent affix that only appears in the presence of an available carrier affix. Adopting and 
extending her terminology, I describe such relations as involving a dependent rule that 
only applies in combination with an available carrier rule. As I show, a dependent rule 
may figure in the application of more than one block ifthe rules on which it is dependent 
differ in their block membership. Instances of this sort are of two kinds. 


3 Two rule applications are seen as involving the “same rule" if they realize the same morphosyntactic con- 
tent by means of the same exponent even if they introduce that exponent into different positions. 
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First, there are instances of multiple exponence in which the same rule of affixation 
apparently applies in more than one block in a word form’s inflectional realization; an 
example from the Limbu language [Kiranti; Nepal] is the multiple exponence of certain 
agent concord properties in the inflection of transitive verbs. Second, there are instances 
of polyfunctionality involving a rule of affixation whose function varies systematically 
according to the block in which it applies; an example is the polyfunctionality of concor- 
dial affixes in the verbal inflection of the Bantu languages. In order to account for cases 
of these two sorts, it is desirable to supplement (1) and (2) with principles (3) and (4). 


(3) A dependent rule may be conflated with a carrier rule to produce a more 
complex rule. Where a dependent rule R; realizes property set o by means of 
exponent x and its carrier rule R? realizes property set t by means of exponent y, 
the conflation of R; with R3 is intuitively a rule realizing the property set o Ut 
by means of the combined exponents x and y. 


(4) A rule block may contain both simple and conflated rules. 


As I shall show, these principles afford economical accounts of multiple exponence 
in Limbu verbs and of polyfunctional verbal concord markers in Southern Sotho [Bantu; 
Lesotho]. After describing the expression of agent properties in Limbu verbs (82) and 
the Southern Sotho system of verbal concord (83), I propose a formal framework for 
rule conflation in inflectional morphology (84). I then present explicit theories of the 
observed pattern of multiple exponence in Limbu ($5) and that of polyfunctional concor- 
dial morphology in Southern Sotho ($6). I conclude with some observations about the 
wider importance of rule conflation in an adequate theory of morphology (87). 


2 Multiple exponence in the expression of agent inflection 
in Limbu verbs 


In Limbu, two agent concord suffixes participate in relations of multiple exponence: 
-ņ, an expression of first-person singular agent concord, and -m, an expression of non- 
third-person plural agent concord. Table 1 exemplifies the distribution of these suffixes 
in positive forms of HU?Ma? ‘teach’. (In this table, parenthesized segments are superfi- 
cially elided in prevocalic position by an ordinary phonological process.) Both suffixes 


^ The structure of Table 1 should be carefully noted. Each row in the table is occupied by a different word 
form in the paradigm of the Limbu verb nu?MA? ‘teach’. Each word is in exploded form, with its parts 
arranged in columns corresponding to the affix position classes postulated by van Driem. (I follow him in 
labeling these classes pf1 and sfi-sf10.) Thus, the word form in the 1s — 3ns row of the nonpreterite part 
of the table is hu?r-u-n-si-n ‘I teach them’. This table does not comprise the complete paradigm of HU?MA? 
‘teach’, but encompasses those forms that involve the agent suffixes -7 and -m (as well as a few other 
pertinent forms in which the appearance of these suffixes is overridden). The claim that these suffixes 
appear in two different positions means that they appear in two different columns, since each column 
defines an affix position class in the traditional sense. 
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appear in two different affix positions; van Driem (1987) labels these positions sf5 and 


sf9.° 


Table 1: The agent suffixes -y and -m in positive forms of the Limbu verb 
HU?MA? ‘teach’ 


agent pfi sf 
— patient a b stem! 1 2 4 5 7 8 9 10 
1s — 2s hu? ne 
1s — 2d hu? ne ci n 
& 1s— 2p hu? n(e) i 0 
9 1s— 3s hurr u 
S 1s — 3ns hurr u n si n 
S8 lIpi—3s a hu?r u m 
Z lpi —> 3ns a hu?r u m si m 
lpe — 2 hu? ne ci ge 
lpe — 3s hurr u m be* 
lpe — 3ns hu?r u m si m Gei 
2—1 a gei hu? 
2p — 3s ke — bur u m 
2p — 3ns ke ` bur u m si m 
1s — 2s hu? n(c) e 
1s — 2d hu? n(c) e ci n 
o 1s—2p hu? n(c) (e) i n 
€^ 1s 3s hu?r () u mg 
9 1s— 3ns hurr e€) u m si n 
Se lpi — 3s a hu?r (el u m 
l]pi—3ns a hufr (el u m si m 
lpe — 2 hu? n(c) e ci ge 
lpe — 3s hu? mina 
lpe — 3ns hu? mina si 
2—1 a g&  hufr € 
2p — 3s ke ` bur u m 
2p — 3ns ke ` bur u m si m 


1. bur is a prevocalic alternant of hu?. 

2. ge is an alternant of ke (van Driem 1987: 2) 

3. s becomes c after e (van Driem 1987: 77) 

4. beis a phonologically conditioned alternant of ge (van Driem 1987: 102) 


5 Affix positions sf3 and sf6 are missing from Table 1 because the affixes that appear in these positions don't 
occur in forms having a first-person singular agent or a nonthird-person plural agent. 


424 


19 Rules and blocks 


The distribution of these suffixes is, in fact, doubly puzzling. Besides participating in 
relations of multiple exponence, they also exhibit gaps in their distribution. Consider 
first the suffix -7. Because ten of the forms in Table 1 realize first-person singular agent 
properties, all ten would be compatible with the appearance of the -7 suffix in both the 
sf5 and sf9 positions. Yet, only two of the forms exhibit -7 in both positions; two exhibit 
it only in position sf5; four, only in postion sf9; and two lack -7 altogether. Consider like- 
wise the distribution of the nonthird-person plural agent suffix -m. Among the fourteen 
forms in Table 1 that realize nonthird-person plural agent properties, only five exhibit 
-m in both the sf5 and sf9 positions; five have it in the sf5 position only; and four lack 
-m altogether. 

A cursory examination reveals the distributional generalization accounting for these 
results: -7 and -m appear in position sf5 only if there is an overt affix in position sf4, 
and they appear in sf9 only if there is an overt affix in sf8. In other words, the rules 
that introduce -7 and -m in Limbu are dependent rules whose application presumes that 
of a carrier rule filling position sf4 or sf8. Because there are carrier rules in more than 
one rule block, the -7 and -m rules may both figure in the application of more than one 


block. 


3 Polyfunctional concordial morphology in the verb 
inflection of Southern Sotho 


Typically of Bantu languages, Southern Sotho has a rich noun-class system one of whose 
manifestations is the inflection of verbs for the noun class of their subject and object ar- 
guments. In the analysis proposed by Doke & Mofokeng (1985), this system exhibits 
seven noun classes; these have the effect of subclassifying the third person, so that like 
the first and second persons, each noun class subsumes both singular and plural forms. 
Table 2 presents the inventory of prefixes by which verbs inflect for the person, num- 
ber and noun class of their subject. By a similar inventory of prefixes, transitive verbs 
may? inflect for the properties of their object; the examples in Table 3 illustrate. Table 4 
presents the inventories of subject-coding and object-coding prefixes side by side; as this 
table shows, the two inventories are nearly identical; the only exceptions are in the sin- 
gular of the first person and of class 1, where the exponents of subject properties differ 
from those of the corresponding object properties. 

The principal difference between the two inventories is morphotactic: subject-coding 
prefixes occupy the position before that of tense prefixes (such as the future-tense prefix 
tla- in Tables 2 and 3) while object-coding prefixes occupy the position following that 
of tense prefixes. Thus, the general pattern is that the prefixes in Table 4 express proper- 
ties of person, number and noun class, and that it is a prefix’s position that determines 
whether the properties that it expresses are subject or object properties. Put another 


$ Unlike the subject concords, whose use is obligatory in finite forms, the object concords are optional, 
generally being use to express a pronominal object rather than to express agreement with an overt object 
phrase (Doke & Mofokeng 1985: 242). 
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Table 2: Future-tense forms of Southern Sotho Bona ‘see’: ‘I/ you / etc. will 
see’ (Doke & Mofokeng 1985: 207ff.) 


Subject Subject class Subject number 
person Doke & Meinhof Singular Plural 
Mofokeng sg pl 


1 ke-tla-bóna _ re-tla-bóna 
2 u-tla-bona  le-tla-bóna 
3 1 1 2 0-tla-bóna ba-tla-bóna 
2 3 4 0-tla-bóna &-tla-bóna 
3 5 6/10 le-tla-bóna ` a-tla-bóna, li-tla-bona 
4 7 8 sé-tla-bona  li-tla-bóna 
5 9 10 é-tla-bona li-tla-bona 
6 14 6 bo-tla-bóna a-tla-bòna 
7 15/17 ho-tla-bona 


Table 3: Future-tense forms of Southern Sotho BONA ‘see’: ‘they will see me / 
you / etc’ (Doke & Mofokeng 1985: 242ff) 


Object Object class Object number 
person Doke & Meinhof Singular Plural 
Mofokeng sg pl 
ba-tla-m-póna ba-tla-re-bóna 
2 ba-tla-u-bóna ba-tla-le-bóna 
3 1 1 2 ba-tla-mo-bóna ` ba-tla-ba-bóna 
2 3 4 ba-tla-o-bóna ba-tla-e-bóna 
3 5 6/10 ba-tla-le-bóna ba-tla-a-bona, ba-tla-li-bona 
4 7 8 ba-tla-sé-bona ba-tla-li-bona 
5 9 10 ba-tla-e-bóna ba-tla-li-bóna 
6 14 6 ba-tla-bo-bóna ba-tla-a-bóna 
yi 15/17 ba-tla-ho-bóna 


way, the rules introducing the noun-class concords in Table 4 generally figure in the 
application of more than one rule block, expressing subject properties in one block and 
object properties in another. 


4 Rule conflation 


It is clear from the foregoing evidence that in the definition of a language's inflectional 
morphology, the same realization rule may figure in the application of more than one 
rule block. I propose that this is an effect of the phenomenon of rule conflation; in 
particular, I propose that when rule R figures in the application of both Blocks A and B, 
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Table 4: Indicative verbal concords in Southern Sotho (Doke & Mofokeng 1985: 


197,243) 
Class Subject Object 
Person Doke& Meinhof sg pl sg pl 
Mofokeng sg pl 
1 ke- re N-* re 
2 u- lē- u- lē- 
3 1 1 2 0- ba- mõ- ba- 
2 3 4 o E 0- e- 
3 5 6/10 le- a-li- le- a-, li- 
4 7 8 sē- li- sē- li- 
5 9 10 e li- ē- li- 
6 14 6 bo- a- bo- a- 
7 15/17 hō- ho- 


*N represents a homorganic nasal 


it is because R may conflate both with certain Block A rules and with certain Block B 
rules. I represent the conflation of R; with R? as [R; © R2]. 
I make six essential assumptions about the definition of rule conflation. 


4.1 Rule-block membership 


A conflated rule [R; © R2] belongs to the same rule block as its carrier rule Ro. 


4.2 Forms defined by conflated rules 


Where R; is a rule that affixes a by means of operation F and R; is a rule that affixes b 
by means of operation G, the conflated rule [R; © R2] affixes b’ by means of operation 
G, where b’ is the result of affixing a to b by means of operation E According to this 
definition, there are four logically possible patterns of conflation for rules of affixation; 
these are represented schematically in part (A) of Table 5. The conflation of Ry with R3 is 
analogous to function composition when R; and R? both effect prefixation or when both 
effect suffixation. But when R; is prefixational and R» is suffixational, the application 
of [R; © Rz] to stem X is Xab rather than aXb; and when R; is suffixational and R, is 
prefixational, the application of [R; © R2] to stem X is baX rather than bXa. In these 
latter cases, the conflation of R; with R; cannot be likened to the mathematical notion 
of function composition. 
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Table 5: Six logical possibilities for the conflation [Rj © R2] of a dependent rule 
R; with a carrier rule R2 


Dependent Carrier Conflated [Ri © R5] applied 
rule R; rule R3 rule [R; © R3] to stem X 
(A) a-prefixation b-prefixation ab-prefixation abX 
a-prefixation b-suffixation ab-suffixation Xab 
a-suffixation b-prefixation ba-prefixation baX 
a-suffixation b-suffixation ba-suffixation Xba 
(B) a-prefixation identity function ` a-prefixation ax 
a-suffixation identity function ` a-suffixation Xa 


4.3 A conflated rule's direction of affixation 


Whether [R; © R2] is a rule of prefixation or suffixation is uniquely determined by the 
properties of Rı and R2. If R? is a rule of affixation, then the direction of affixation of 
[R; © Rg] is that of Ro, as indicated in (ii) above; but if Rz is a rule of significative absence," 
then the direction of affixation of [R; © R2] is that of Ri, as in part (B) of Table 5. 


4.4 Content realized by a conflated rule 


If rule R; realizes the morphosyntactic property set œ and rule R realizes the property 
set p, then rule [R; © R2] realizes the combination of o and f. In the simplest cases, the 
relevant mode of combination can simply be seen as set union: o U f. But in the general 
case, it is preferable to regard the mode of set combination as unification;? for instance, 
the combination of {fut, {sbj 3 sg}} with {{sbj fem}} should be the unification {fut {sbj 3 
sg fem}} rather than the union {fut, {sbj 3 sg}, {sbj fem}}. That is, if Ry realizes o and R32 
realizes p, then [R; © R2] realizes the unification a LI p. 


7 A rule of significative absence realizes a particular property set by means of an identity function. In a 
realizational theory of morphology, a rule of significative absence realizing a property set o overrides the 
overt morphology of a competing rule realizing some property set of which o is an extension. (Cf. the 
analysis of Bulgarian verb inflection proposed by Stump 2001: 441ff). 

8 The assumed definition of unification is as in (i); this definition depends on the assumed definition of 
extension in (ii). (Cf. Gazdar et al. 1985: 27; Stump 2001: 41.) 


(i) The unification of p and o [i.e. p L c ] is the smallest well-formed extension of both p and o . 
Example: {{sbj 3 sg}, {obj pl} u (prs, (obj 1} = {{sbj 3 sg}, prs, {obj 1 pl} 


(ii) Given two sets o, t: o is an extension of x [i.e. t E o ] iff for each property x € q, 


either (i) x is simple property and x € o 
or (ii) x is a complex property (= a set of properties) such that y € c and y is an extension of x. 
Examples: {pl} E (1 pl} 
{prs {obj 1j E {prs {obj 1 pl} 


428 


19 Rules and blocks 


4.5 Recursion 


The definition of rule conflation does not exclude the possibility that a conflated rule 
might itself enter into the conflation of a still more complex rule; that is, rule conflation 
may be recursive. 


4.6 Nonconcatenative rules and conflation 


A priori, there is no reason why the morphological rules that enter into such conflations 
must necessarily be rules of affixation or of significative absence. The most convinc- 
ing cases, however, do involve rules of these two sorts, and I shall focus exclusively on 
such cases here. Nevertheless, nothing that I say here should be seen as excluding the 
possibility that nonconcatenative rules might also enter into relations of rule conflation. 

Rule conflation is an operation on rules rather than on affixes; nevertheless, if R; and 
Rz are rules introducing the respective affixes a and b, one can, as a kind of shorthand, 
refer to the affix ab (or ba) introduced by the conflated rule [R; € Rz] as a CONFLATED 
AFFIX. 

As I now show, this conception of rule conflation affords a straightforward account 
of multiple exponence in the expression of Limbu agent inflection (85) and of polyfunc- 
tional concordial morphology in Southern Sotho (86). 


5 Rule conflation and multiple exponence in Limbu 


Consider again the inflection of Limbu verbs for the properties of their agent argument. 
As was seen in 82, the suffix -p (expressing first-person singular agent properties) and 
the suffix -m (expressing nonthird-person plural agent properties) may appear in either 
of two positions—or in both—in a verb form's inflectional morphotactics; but their ap- 
pearance in either position is dependent on that of a suffix in the immediately preceding 
position. The following analysis of this distributional pattern is based on two key as- 
sumptions:? 


e the agent-coding suffixes -7 and -m are introduced by dependent rules that only 
apply in conflation with another, “carrier” rule, and 


e carrier rules for the -7 and -m rules exist in more than one block. 


This analysis employs independent realization rules that introduce the suffixes! in 
Table 1; these are organized into several rule blocks, each of which fills a particular affix 
position. These independent rules and their block membership are given in Table 6. There 
are also dependent realization rules; these introduce the agent-coding suffixes -n and -m, 


? The dependent rules at issue in the proposed analyses of Limbu and Southern Sotho are only manifested 
in conflation with a carrier rule. But one can also imagine that a rule might be able to function both as a 
dependent rule and as an independent rule; the rules introducing the Swahili relative affixes are argued to 
have this status in the analysis proposed by Stump (To appear). 

10 Concerning the person prefixes a- and ke- in Table 1, see van van Driem 1987: 77ff. 
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as in (5). Rule conflation is defined by the conflation rules in (6). Rule (6a) conflates the 
dependent rules with the three carrier rules identified in Table 6: 4-a, 8-a and 8-b. The 
resulting conflated rules are listed (redundantly) in Table 7. 

Because a conflated rule belongs to the same rule block as the carrier rule on which it is 
based, the conflated rule and the carrier rule compete to realize certain morphosyntactic 
property sets; in any such instance of competition, the conflated rule prevails by virtue 
of the fact that its domain of application is smaller than that of the carrier rule.” 


Table 6: Some independent realization rules of Limbu verb inflection 


Block Rule Realization rules Carrier 

label Properties realized Operation rule? 

sf1 La {{agt 1} {pat 2} X — Xne 

sf2 2-a {pret} X Xe 29 

sf4 4-a {fpat3} X — Xu yes 

sf7 7-a  {fagt1 ns} {pat 2} X  Xci no 
Sa {fpat ns} X  Xsi 

SAC op Wee -3 incl pl} XS MI yes 

sfl0 ` 10-a Iesch) X — Xge no 


sf11 11-a  ((agtiplexcl]pretípat3] X — Xm?na  yes,for 8-a 


(5) Dependent realization rules 
D. {agti sg} : X Xy (van van Driem 1987: 99) 
M. l[fagt-3pl) :X Xm (van van Driem 1987: 99f) 


(6) Conflation rules 


a. Where R is a rule in Block æ (a c (4, 8}, 
[N © R], [M © R] € Block a. 


b. [8-a © 11-a] € Block 11.? 


i Concerning each rule in Table 6, see van Driem (1987): 1-a, pp.88f; 2-a, pp.89ff; 4-a, p.82; 7-a, p.100; 8-a, 
pp.101f; 8-b, pp.95f; 10-a, pp.102f; 11-a, 100f. 

12 Conflation rule (6b) helps to resolve a conundrum in Table 1. Notice first that the suffix -m?na introduced by 
rule 11-a as an exponent of the property set {{agt 1 pl excl} pret {pat 3}} only combines with one other suffix, 
namely the suffix -si introduced by rule 8-a as an exponent of {{pat ns}}; yet, it is featurally compatible with 
the suffixes introduced by 4-a and 10-a. Moreover, the suffix -si in the form hu?-m?na-si ‘we (excl) taught 
them’ does not carry -m, even though (a) it is a carrier elsewhere and (b) -m would be featurally appropriate 
for this word form. I therefore depart from van Driem in postulating Block 11 as a portmanteau rule block 
(Stump 2001: 141) that is paradigmatically opposed to and defaults to the sequence of other suffixal blocks. 
It houses exactly two rules: the simple rule 11-a (which suffixes -mZna) and the conflated rule [8-a © 11-a] 
(which suffixes -m?na-si). Because Block 11 is paradigmatically opposed to the sequence of rule blocks 
to which Block 8 belongs, the application of rule [8-a © 11-a] excludes that of rule [M © 8-a], effectively 
blocking the appearance of -m in forms such as hu?-m?na-si ‘we (excl) taught them’. 
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Table 7: Some conflated realization rules of Limbu verb inflection 


Realization rules 


Block Rule label 


Properties realized Operation 
4 [I) © 4-a] {agt 1 sg} {pat 3} X Xun 
[M © 4-a] {agt -3 pl} {pat 3} X — Xum 
[N © 8-a] {{agt 1 sg} {pat ns}: X — Xsin 
8 [N © 8-b] {{agt 1 sg} {pat-3 -incl pl} X —> Xin 
[M © 8-a] {{agt -3 pl} {pat ns}: X  Xsim 
11 [8-a © 11-a] {fagt1 pe} pret {pat 3 ns} X —> Xm?nasi 


This analysis correctly defines all of the forms in Table 1. In particular, it accounts 
for the superficially erratic distribution of the agent concords -y and -m. Thus, Table 8 
presents the manner in which the rules in Tables 6 and 7 define four words: 


e hu?r-u-n-si-n ‘I teach them’, in which -y appears twice—after -u and after -si; 
e hufr-u-r ‘I teach him’, in which -n appears after -u only; 

e hu?-ne-ci-y ‘I teach you two’, in which -p appears after -si only; and 

e hu?-ne ‘I teach you (sg.)’, in which -7 fails to appear. 


As Table 8 shows, -7 only appears in conflation with an immediately preceding carrier: 
in one case, it appears twice because there are two carriers to conflate with; in another, 
only the carrier -u is available; in yet another, only the carrier -si is available; and some- 
times, there is no carrier at all to conflate with. The proposed analysis provides a similar 
account of the comparable behavior of the suffix -m. 


6 Rule conflation and polyfunctional concord in Southern 
Sotho 


Return now to the morphology of verbal concord in Southern Sotho. As we saw in §3, 
this morphology is largely polyfunctional. Typically, a verbal concord may appear in ei- 
ther of two positions in a verb’s inflectional morphotactics; but unlike the agent-coding 
suffixes in Limbu, which express the same content no matter where they appear, the 
Southern Sotho verbal concords express subject properties in one position but object 
properties in another. The notion of rule conflation makes it possible to account for this 
difference by assuming that in Southern Sotho, the rules expressing noun-class concord 
conflate with a general rule of subject concord in one block and with a general rule of 
object concord in a different block. Because the two general rules are formulated as iden- 
tity functions (realizing subject concord and object concord, respectively), the conflated 
subject concords have the same phonological form as the conflated object concords. 
Thus, consider the following definition of the Southern Sotho inflectional markings in 
Tables 2 and 3. In this analysis, there are three blocks of independent realization rules, 
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Table 8: The definition of four Limbu verb forms in the proposed analysis 


Property set: {agt 1 s} {pat 3 ns} {{agt 1 s} {pat 3 s} 
Stem: hu? ‘teach’ hu? ‘teach’ 
(prevocalically hu?r) (prevocalically hu?r) 
Rule applying in 
Block: ` st: (none) (none) 
sf2: (none) (none) 
sf4: [D © 4-a]: hu?r-u-n [N © 4-a]:hu?r-u-n 
sf7: (none) (none) 
sf8: [D © 8-a]: hu?r-u-n-si-n (none) 
sf10: (none) (none) 
sf11: (none) (none) 
hurr-u-n-si-n hu?r-u-n 
‘I teach them’ ' teach him’ 


Property set: 


{{agt 1 s} {pat 2 dell {agt 1 s} {pat 2 s} 


Stem: hu? ‘teach’ hu? ‘teach’ 
Rule applying in 
Block: sfi: 1-a: hu?-ne 1-a: hu?-ne 

sf2: (none) (none) 
sf4: (none) (none) 
sf7: (none) (none) 
sf8: [D © 8-a]: hu?-ne-ci-n (none) 
sf10: (none) (none) 
sf11: (none) (none) 

hur-ne-ci-n hur-ne 

‘I teach you two’ ‘I teach you (sg.)’ 


as in Table 9. Block a houses the rules of object concord: these include the special ob- 
ject-concord rules for the first-person singular (a-i) and third singular class 1 (a-ii); in 
addition, it includes a default rule (a-iii) realizing object concord by means of an iden- 
tity operation. Block b houses rules realizing tense properties, here exemplified by the 
future tense. Block c houses rules of subject concord, including the special rule (c-i) of 
first-person singular subject concord and a default rule (c-ii) realizing subject concord 
by means of an identity operation. In addition to the independent realization rules in 
Table 9, the analysis requires the large inventory of dependent rules in Table 10. The 
conflation rule in (6) conflates each dependent rule with the default object-concord rule 
(a-iii) and with the default subject-concord rule (c-ii). The resulting conflated rules are 
listed (redundantly) in Table 11. 
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Table 9: Three blocks of independent realization rules in Southern Sotho 


Realization rules 


Block Rule Carrier 
label Properties realized Operation rule? 
a-i {{obj 1 sg} X — NX* 
a a-ii {{obj 3 sg cO X — moX 
a-iii {{obj}} XX yes 
b b-i {fut} X — tlax 
c-i ([sbj 1 sg} X — Lex 
R c-ii {{sbj}} XX yes 


*N represents a homorganic nasal. 


Table 10: Dependent realization rules for verbal concord in Southern Sotho 


Rule 


Realization rules 


label 
agr-i 
agr-ii 
agr-iii 
agr-iv 
agr-v 
agr-vi 
agr-vii 
agr-viii 
agr-ix 
agr-x 
agr-xi 
agr-xii 
agr-xiii 
agr-xiv 


Properties realized Operation 


{{2 sg} 

(5 sg} 

{{3 sg cL:3}} 
{{3 sg cL:4} 
{{3 sg ca 
{{3 sg cL:6}} 
{{3 cr:7] 

HU) pi} 

{{2 pl} 

{{3 pl cx:1} 
{{3 pl cx:1|2}} 
{{3 pl cx:3}} 
{{3 pl cx:4|5}} 


X — uX 
X — oX 
X o IEN 
X — seX 
X — eX 
X — box 
X — hox 
X  reX 
X o IEN 
X — baX 
X eX 
X — aX | lix 
X  liX 
X aX 


([3 pl c:6]) 


Conflation rule 


Where agr-n is a dependent realization rule and R is a carrier rule in Block a, 


[agr-n © R] € Block a. 
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Table 11: Some conflated realization rules of Southern Sotho verb inflection 


Block Rule label 


Realization rules 


Properties realized 


Operation 


a 


[agr-i © a-iii] 
[agr-ii © a-iii] 
[agr-iii © a-iii] 
[agr-iv © a-iii] 
[agr-v © a-iii] 
[agr-vi © a-iii] 
[agr-vii © a-iii] 
[agr-viii © a-iii] 
[agr-ix © a-iii] 
[agr-x © a-iii] 
[agr-xi © a-iii] 
[agr-xii © a-iii] 
[agr-xiii © a-iii] 
[agr-xiv © a-iii] 


ffobj 2 sg} 

{{obj 3 sg cx:1|2} 
(obj 3 sg c1:3} 
(obj 3 sg cr:4H 
{fobj 3 sg cL:5}} 
(obj 3 sg ce 
(obj 3 cL:7}} 
{obj 1 pl} 

{{obj 2 pl} 

{{obj 3 pl ci:1}} 
(obj 3 pl ci:1|2}} 
{{obj 3 pl cr:3]] 
{{obj 3 pl c1:4|5} 
obt 3 pl ci:6}} 


X — ux 
X — 0X 
X — IEN 
X — séX 
X — eX 
X — box 
X — hox 
X  reX 
X o IEN 
X — baX 
X — eX 
X — aX | lix 
X  liX 
X aX 


[agr-i © c-ii] 
[agr-ii © c-ii] 
[agr-iii © c-ii] 
[agr-iv © c-ii] 
[agr-v © c-ii] 
[agr-vi © c-ii] 
[agr-vii © c-ii] 
[agr-viii © c-ii] 
[agr-ix © c-ii] 
[agr-x © c-ii] 
[agr-xi © c-ii] 
[agr-xii © c-ii] 
[agr-xiii © c-ii] 
[agr-xiv © c-ii] 


{{sbj 2 sey} 

{{sbj 3 sg cL:1|2}} 
{{sbj 3 sg c1:3} 
{{sbj 3 sg cr:4H 
{{sbj 3 sg cL:5}} 
{{sbj 3 sg cL:6}} 
{{sbj 3 cx:7}} 
{sbj 1 pl) 

((sbj 2 pl} 

{{sbj 3 pl cx:1} 
{{sbj 3 pl cx:1|2} 
{{sbj 3 pl cx:3} 
{{sbj 3 pl cx:4|5} 
{{sbj 3 pl cL:6} 


X — ux 
X — 0X 
X o IEN 
X — seX 
X eX 
X — box 
X — hox 
X  reX 
X o IEN 
X — baX 
X — eX 
X — aX | lix 
X  liX 
X — ax 


Each of the conflated rules in Table 11 belongs to the same rule block as the carrier 
rule on which it is based. As in the Limbu analysis proposed above, a conflated rule and 
the carrier rule on which it is based compete to realize certain morphosyntactic property 
sets, and being the narrower rule, the conflated rule prevails in each such case. 

This analysis correctly defines all of the forms in Tables 2 and 3. In particular, it 
accounts for the fact that in all but a handful of cases, each subject concord has a cor- 
responding object concord that expresses the same person, number and noun class by 
means of the same prefix. Thus, Table 12 presents the manner in which the rules in Tables 
10 and 11 define two words: 


434 


19 Rules and blocks 


e ba-tla-bo-bona ‘they (cr:1) will see it (cL:6)’, in which ba- is a third-person plural 
class 1 subject concord and bo- is a singular class 6 object concord; and 


e bo-tla-ba-bóna ‘it (cr:6) will see them (cr:1), in which bo- is a singular class 6 


subject concord and ba- is a third-person plural class 1 object concord. 


Table 12: The definition of two Southern Sotho verb forms in the proposed 
analysis 


Property set: — ((sbj 3 pl cr:1) fut {obj 3 sg cx:6}} 
Stem: bona ‘see’ 
Rule applying in 
Block a: [agr-vi©a-iii]: — bo-bóna 
b: b-i: — tla-bo-bóna 
[agr-x © c-ii]: — ba-tla-bo-bóna 
ba-tla-bo-bona 
‘they (cL:1) will see it (cL:6)’ 
Property set: — ((sbj 3 sg cr:6) fut {obj 3 pl cx:1}} 
Stem: bona ‘see’ 
Rule applying in 
Block a:  [agr-x©a-iii]: ba-bona 
b: b-i:  tla-ba-bona 
c: [agr-viO c-ii]: bo-tla-ba-bóna 
bo-tla-ba-bona 
“it (cL:6) will see them (cr:1)' 


As Table 12 shows, the dependent rules introducing bo- (agr-vi in Table 10) and -ba 
(agr-x in Table 10) both conflate with the carrier rule a-iii (Table 9) to produce rules of 
object concord in Block a and both conflate with the carrier rule c-ii (Table 9) to produce 
a rule of subject concord in Block c. 


7 Wider evidence for rule conflation 


The analyses proposed here for multiple exponence in Limbu agent concord and for 
polyfunctional verbal concords in Southern Sotho both depend on the notion that mor- 
phological rules may conflate to produce more complex rules (- principle (3)) and the 
notion that conflated rules may compete with simple rules as members of the same rule 
block (= principle (4)). 

These principles of rule conflation are motivated independently of the need to account 
for multiple exponence and polyfunctionality. First, they make it possible to account for 
apparent anomalies in the interaction of inflectional rule applications. For example, a 
rule's order of application may seem to depend on whether or not another rule applies. 
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In Fula, a pronominal object suffix on a verb in the relative past tense ordinarily follows 
that verb’s subject suffx, as in (8a,8b); but in the particular case in which a verb has 
both a singular personal object suffix (2sg -mA or 3sg -mO) and the first-person singular 
subject suffix -mi, the expected order is reversed, as in (8c,d). Principles (3) and (4) allow 
one to say that the rules realizing the subject and object suffixes in the relative past tense 
belong to a single rule block; that the object rules ordinarily conflate with the subject 
rules; but that the -mi rule instead conflates with the -mA and -mO rules. 


(8) a. mball-u-mi-be-’ 
help-REL.PST.ACT-1SG.SBJ-3PL.CL.2.0BJ-FG 
‘T helped them’ 

b. mball-u-daa-mO-’ 
help-REL.PST.ACT-2SG.SBJ-3SG.CL.1.0BJ-FG 
‘you (sg.) helped him’ 

c. mball-u-mA-mi-’ 
help-REL.PST.ACT-2SG.OBJ-1SG.SBJ-FG 
‘I helped you (sg.)’ 

d. mball-u-mO-mi-’ 
help-REL.PST.ACT-3SG.CL.1.0BJ-1SG.SBJ-FG 
‘T helped him’ (Arnott 1970, Appendix 15) 


In another apparently anomalous interaction of inflectional rules, an affix either pre- 
cedes the stem with which it joins or follows it, with the choice of position being con- 
ditioned by the presence or absence of some other affix. In Swahili, the verbal concord 
coding the properties of a relative verb form’s relativized argument appears postver- 
bally in tenseless affirmative forms, but preverbally in forms that are prefixally marked 
for tense or negation; thus, the class 8 relative concord vyo is postverbal in (9a) but pre- 
verbal in (9b). The principles of rule conflation make it possible to say that the relative 
affix is suffixed to the verb stem by default, but is suffixed to an overt prefixal exponent 
of tense or negation (Stump To appear). 


(9) a. a-vi-soma-vyo 
SBJ:CL.1-OBJ:CL.8-read-REL:CL.8 
‘(books) which he reads’ 
b. a-si-vyo-vi-soma 
SBJ:CL.1-NEG-REL:CL.8-OBJ:CL.8-read] 
‘(books) which he doesn't read’ 


As Stump (2017a) shows, the principles of rule conflation afford simple solutions to a 
number of other apparent anomalies in the interaction of inflection rules. These include 
the incidence of variable affix order (Bickel et al. 2007) and of Wackernagel affixes (Nevis 
& Joseph 1992, Bonami & Samvelian 2008) as well as the superficially puzzling fact that 
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affix sequences may preserve the same internal order whether the sequence as a whole 
is prefixal or suffixal, as in European Portuguese verb inflection (Luis & Spencer 2005). 

Second, the principles of rule conflation in (3) and (4) make it possible to account 
for nonmonotonic interactions among inflectional rules. The usual expectation is that 
a realization rule possesses the same intrinsic properties whether it applies alone or in 
combination with other rules. But there are anomalous cases in which this expectation 
is not met. Once the definition of a language’s morphology includes a conflated rule 
IR, © R;], this rule may evolve independently, taking on properties not directly stem- 
ming from either R; or R2. In this way, the properties exhibited by a rule applying in 
isolation may not always be preserved when it is conflated with other rules. In view of 
this fact, the content attributed to conflated rules in §4.4 above should be seen as their 
default content, subject to modification by processes of grammaticalization. That is, the 
content expressed by rule [R; © R32] is, in the default case, a monotonic function of the 
content expressed by rules R; and R3; but this default is subject to override. 

There are at least three ways in which the resulting nonmonotonicity may be mani- 
fested. One reflection of this fact is the phenomenon of “potentiation” (Williams 1981), 
by which an unproductive rule becomes productive when applying in combination with 
another rule (as the unproductive -ity rule becomes productive in combination with the 
-able rule; cf. Aronoff 1976, Bochner 1992). 

Another reflection is the fact that the domain of rule R; seems to depend on whether a 
particular rule Rg applies subsequently. By principles (3) and (4), such cases arise when 
a conflated rule [R; © R2] evolves a domain application distinct from that of Rz. Thus, 
a stem may be in the domain of R? but not that of [R; © R2], as in the case of base > 
basic, * basical; at the same time, a stem may be in the domain of [R; © R3] but not that 
of Rg, as in the case of whimsy — whimsical, *whimsic. A third reflection arises in cases 
in which two rules apparently realize less content separately than they do together. In 
Latin regémus ‘we shall rule’, the conflation of the rules that suffix -é and -mus expresses 
the first-person plural future active even though neither rule by itself is an expression 
of future tense.’ These nonmonotonic phenomena have never before been seen as man- 
ifestations of a single overarching principle; the principles of rule conflation, however, 
facilitate precisely such a perspective. 

Third, the principles of rule conflation make it possible to account for parallelisms be- 
tween the application of a single rule and that of a sequence of rules. A word form's in- 
flectional morphology is sometimes informally conceived of as instantiating a sequence 
of "slots" each of which corresponds to a set of rules available to fill it. Andersonian rule 
blocks are a kind of formal reconstruction of this idea, whose simplest interpretation 
involves individual rules providing alternative ways of filling the same slot. There are, 
however, apparent deviations from this pattern, in which successive slots are ordinar- 
ily filled by successive rule applications but may in some instances be simultaneously 
filled by a single rule application introducing a “wide” affix that somehow straddles two 
or more slots. The Swahili portmanteau prefix si- is an example. In Swahili negative 
indicative verb forms, the usual pattern is for the negative ha- rule to fill slot 1 and a 


13 See Stump (20172) for discussion of a similar case from Old English. 
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subject-concord rule to fill slot 2, e.g. ha-tu-ta-taka [NEG-1PL-FUT-want] ‘we will not 
want’. But in first-person singular negative verb forms, the application of the negative 
first-person singular si- rule seems to straddle slots 1 and 2. The principles of rule confla- 
tion resolve this conundrum by allowing a rule block to contain both conflated rules (e.g. 
the first-person plural negative ha-tu- rule) and simple rules (e.g. the si- rule) in paradig- 
matic opposition; in this way, the behavior of portmanteau rules is reconciled with the 
natural assumption that paradigmatic opposition is a relation between two rules rather 
than a relation between a rule and a sequence of rules. 

The principles of rule conflation in (3) and (4) are a simple and natural extension of 
the principles of realization-rule interaction developed by Anderson (see again 1 and 2). 
Rule conflation allows a variety of apparently recalcitrant phenomena to be reconciled 
with a general scheme of rule interaction based on ordered blocks of realization rules in 
which the members of a given block are mutually exclusive in their application. 


Abbreviations 
The following abbreviations are employed for the morphosyntactic properties. 


AGT agent 

PAT patient 

PRET preterite 

1/2/3 first/second/third person 
-3  nonthird person 

EXCL exclusive 

-INCL noninclusive 


NS nonsingular 
SG singular 

PL plural 
References 


Anderson, Stephen R. 1977. On the formal description of inflection. In Woodford A. Beach, 
Samuel E. Fox & Shulamith Philosoph (eds.), Papers from the thirteenth regional meeting 
of the Chicago Linguistic Society, 15-44. Chicago: Chicago Linguistic Society. 

Anderson, Stephen R. 1982. Where’s morphology? Linguistic Inquiry 13(4). 571-612. 

Anderson, Stephen R. 1984a. On representations in morphology: Case marking, agree- 
ment and inversion in Georgian. Natural Language and Linguistic Theory 2. 157-218. 

Anderson, Stephen R. 1984b. Rules as ‘morphemes’ in a theory of inflection. In D. Rood 
(ed.), Mid-america linguistics conference papers, 3-21. Boulder: University of Colorado. 

Anderson, Stephen R. 1986. Disjunctive ordering in inflectional morphology. Natural 
Language and Linguistic Theory 4. 1-31. 

Arnott, D.W. 1970. The nominal and verbal systems of Fula. Oxford University Press. 

Aronoff, Mark. 1976. Word formation in generative grammar. Cambridge, MA: MIT Press. 


438 


19 Rules and blocks 


Bickel, Balthasar, Goma Banjade, Martin Gaenzle, Elena Lieven, Netra Prasad Paudyal, 
Ichchha Purna Rai, Manoj Rai, Novel Kishore Rai & Sabine Stoll. 2007. Free prefix 
ordering in Chintang. Language 83. 43-73. 

Bochner, Harry. 1992. Simplicity in generative morphology. Berlin: Mouton de Gruyter. 

Bonami, Olivier & Pollet Samvelian. 2008. Sorani Kurdish person markers and the typology 
of agreement. Paper presented at the 13th International Morphology Meeting. Vienna. 

Doke, C.M. & S.M. Mofokeng. 1985. In Textbook of Southern Sotho Grammar, 2nd edn. 
Cape Town: Maskew Miller Longman. 

Gazdar, Gerald, Ewan Klein, Geoffrey Pullum & Ivan Sag. 1985. Generalized phrase struc- 
ture grammar. Cambridge: Harvard University Press. 

Harris, Alice C. 2017. Multiple exponence. Oxford: Oxford University Press. 

Luis, Ana & Andrew Spencer. 2005. A paradigm function account of ‘mesoclisis’ in Eu- 
ropean Portuguese. In Geert Booij & Jaap van Marle (eds.), Yearbook of morphology, 
177-228. Dordrecht: Springer. 

Nevis, Joel A. & Brian D. Joseph. 1992. Wackernagel affixes: evidence from Balto Slavic. 
In Geert Booij & Jaap van Marle (eds.), Yearbook of morphology 3, 93-111. Dordrecht: 
Kluwer Academic Press. 

Stump, Gregory. 2001. Inflectional morphology: A theory of paradigm structure. Cam- 
bridge: Cambridge University Press. 

Stump, Gregory. 2017a. Rule conflation in an adequate theory of morphotactics. Vol. 64. 
Acta Linguistica Academica. 

Stump, Gregory. To appear. Polyfunctionality and the variety of inflectional exponence 
relations. In Ferenc Kiefer, James P. Blevins & Huba Bartos (eds.), Morphological para- 
digms and functions. Leiden: Brill. 

van Driem, George. 1987. A grammar of Limbu. Berlin: Mouton de Gruyter. 

Williams, Edwin. 1981. On the notions “lexically related” and “head of a word”. Linguistic 
Inquiry 12. 245-74. 

Zwicky, Arnold M. 1985. How to describe inflection. In Mary Niepokuj, Mary Van Clay, 
Vassiliki Nikiforidou & Deborah Feder (eds.), Proceedings of the eleventh annual meet- 
ing of the berkeley linguistics society, 372-386. Berkeley: Berkeley Linguistics Society. 


439 


Part IV 


Language and Linguistic Theory 


Chapter 20 


Darwinism tested by the science of 
language 
Mark Aronoff 


Stony Brook University 


Linguistics enjoyed great success in the last half of the 19'^ century. The use of tree diagrams 
to express the genetic relations between languages spread from linguistics to evolutionary 
biology. The achievements of the Neogrammarians in establishing sound laws, however, led 
to a realization that the exceptionless laws of language change bore no resemblance in kind 
to the laws of natural science. Language evolution had no principled basis akin to natural se- 
lection. Saussure solved this problem in the Cours by rooting linguistic theory in synchronic 
states of language rather than historical change, thus relegating diachronic linguistics to a 
minor position in the field. In recent decades, the field of cultural evolution has allowed for 
the application of well-established principles from evolutionary biology and ecology. The 
application of one of these, Gause’s principle of competitive exclusion, to central problems 
of morphology has produced good results, suggesting prospects for the revival of evolution- 
ary explanation in language along the lines of what linguists envisioned a century and a 


half ago. 


1 Introduction 


Steve Anderson refuses to forget the past. Along with Peter Matthews, throughout his 
career he has reminded the community of theoretical linguists, especially morphologists, 
of the continuity of our culture. His two most recent publications, one of them a tribute 
to Matthews (Anderson 2017; to appear b), deal with the history of morphology. The 
following brief essay is a small tribute to Steve’s life-long effort to demonstrate that we 
can understand how we think and act today only to the extent that we also understand 
where our thinking comes from. In it, I trace the rise and fall of the relation between 
linguistics and evolutionary biology in the half century between the publications of the 
foundational work of modern biology, Darwin’s On the Origin of Species by Means of 
Natural Selection, or the Preservation of Favoured Races in the Struggle for Life, in 1859, and 
the foundational work of modern linguistics, Saussure’s Cours de linguistique générale, 
in 1916. 


Mark Aronoff. 2017. Darwinism tested by the science of language. In Claire Bowern, 
| Laurence Horn & Raffaella Zanuttini (eds.), On looking into words (and beyond), 443- 
456. Berlin: Language Science Press. Z lo.4 
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In 1859, historical (also termed evolutionary at the time) linguistics was on a meteoric 
trajectory as one of the most successful of academic disciplines, having provided precise 
demonstrations that numerous modern languages were related in specific ways. In the 
next thirty years, it would produce even more remarkable results and Darwin himself 
would invoke its successes. 

Saussure, author in 1878 of one of the most spectacular works in historical linguistics, 
did a complete volte face in his posthumous book forty years later, advocating that the 
field concentrate on description of single language states rather than historical relations. 
He rejected evolutionary accounts of language for one sweeping reason: the lack of 
an explanatory framework. Saussure’s argument quickly turned the field away from 
diachrony to synchrony. 

A century later, though, it has become clear that an evolutionary account of what 
Darwin himself was referring to when he wrote that “The survival or preservation of 
certain favoured words in the struggle for existence is natural selection” (Darwin 1871: 
61) can provide just such a framework, allowing historical explanation of language to 
return to its rich relation with evolution and evolutionary theory. 


2 Schleicher, Haeckel, and stem trees 


They were friends, both professors at the University of Jena, who shared a love of botany 
and gardening. August Schleicher, the linguist, born in 1821, was 13 years older than 
Ernst Haeckel, the embryologist, and had been on the faculty since 1857. Haeckel had 
taken up his post as professor of comparative anatomy in 1862, soon after receiving his 
doctorate in zoology. Haeckel had been captivated by Darwin’s landmark 1859 work and 
had recommended it to Schleicher. Schleicher opened his 1863 pamphlet, Die Darwinsche 
Theorie und die Sprachwissenschaft — offenes Sendschreiben an Herrn Dr. Ernst Haeckel 
[Darwinian theory and language science — an open letter to Dr. Ernst Haeckel] with an 
acknowledgement so notable that I repeat the passage here in its entirety.! 


You would leave me no peace until I began reading Bronn’s [1860] translation of 
the much discussed work of Darwin On the Origin of Species by Means of Natural 
Selection, or the Preservation of Favoured Races in the Struggle for Life. I have com- 
plied with your request; I have waded through the whole of the book, in spite of 
its being rather clumsily arranged, and heavily written in a curious kind of Ger- 
man, and the greater part of the work I was tempted to read again and again. My 
first thanks are now offered to you for those repeated inducements of yours which 
ended in my study of this incontestably remarkable work. (Schleicher 1863/1869: 
13-14) 


Schleicher had very recently achieved academic renown as the author in 1861/2 of the 
two-volume Compendium der vergleichenden Grammatik der indogermanischen Sprachen: 
Kurzer Abriss e. Laut-u. Formlehre d Indogerm. Ursprache, d Altindischen, Alteranischen, 


! All passages from Schleicher (1863) are quoted from Alex V. W. Bikkers's 1869 English translation. 
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Altgriechischen, Altitalischen, Altkeltischen, Altslavischen, Litauischen u. Altdeutschen 
[Compendium of the Comparative Grammar of the Indo-European languages. A brief out- 
line of the Indo-European parent language, Old Indian, Old Iranian, Ancient Greek, Old 
Italian, Old Celtic, Old Slavic, Lithuanian and Old German]. He would die five years later 
in 1868 at the age of 47. Haeckel would outlive his friend by over a half century and 
become the greatest Continental disseminator of Darwinian thought through his best- 
selling book Natiirliche Schépfungsgeschichte, published in the year of Schleicher’s death 
and translated into English as The History of Creation. Haeckel, as an embryologist, is 
perhaps most famous for the slogan “ontogeny recapitulates phylogeny” and the associ- 
ated erroneous theory. Together, though, as described by Burrow (1972), O’Hara (1996), 
and Gontier (2011), the two friends are responsible for the use of tree diagrams in depict- 
ing evolutionary relations among languages first (Schleicher 1861/1862; 1863) and then 
species (Haeckel 1866). Haeckel (1874) presented Stammbdaume lit. ‘stem trees’, of both 
the historical evolution of Indo-European languages and the ‘pedigree of man’ together 
in the same book. What Haeckel had learned from Schleicher during the few years that 
they were colleagues in a small university was that the evolution of languages and the 
evolution of species were sufficiently analogous to warrant the use of the same diagram- 
matic method to describe the two. The method survives to this day in both fields, but 
little else remains in common between the two. How did they move so far apart? 


3 Darwinism tested by the science of language 


Schleicher went further than analogy. The main point of his 1863 pamphlet, only 48 
pages long in the original German edition, shorter than many academic journal articles 
today, was that the results of historical linguistics over the previous half century consti- 
tuted a successful “test” of Darwin’s theory of evolution. First, Schleicher asserted that 
languages were what he called “organisms of nature? He needed this to be true in order 
to directly test Darwin’s theory, which deals with natural organisms, by applying it to 
languages. 


Languages are organisms of nature; they have never been directed by the will of 
man; they rose, and developed themselves according to definite laws; they grew old, 
and died out. They, too, are subject to that series of phenomena which we embrace 
under the name of “life” The science of language is consequently a natural science; 
its method is generally altogether the same as that of any other natural science. 
(Schleicher 1863: 20-21) 


The rules now, which Darwin lays down with regard to the species of animals and 
plants, are equally applicable to the organisms of languages, that is to say, as far as 
the main features are concerned. (Schleicher 1863: 30) 


Here is where Schleicher understood that linguistics had a contribution to make. As is 
well known, Darwin acknowledged in his introduction that he had no direct evidence for 
the application of his theory to "the variability of species in a state of nature" (Darwin 


445 


Mark Aronoff 


1859: 4). The closest he could come was “a careful study of domesticated animals and 
of cultivated plants” (ibid.), which is why he devoted the first chapter of his book to 
“Variation under Domestication.” Only in the last quarter century have we been able to 
observe evolution at work, most notably in Richard Lenski's Long Term Experimental 
Evolution Project (e.g, Tenaillon et al. 2016). Schleicher offered language evolution as the 
only tangible proof of evolution available at the time: 


Nobody doubts or denies any longer that the whole Indogermanic family of speech 
- Indic, Iranic, (old Armenian, Persic, & c.,) Hellenic, Italic, (Latin, Oscan, Umbrian, 
with the daughters of the former) Keltic, Slavonic, Lithuanian, Teutonic or German, 
that all these languages, consisting of numerous species, races and varieties, have 
taken their origin from one single primitive form of the Indo-Germanic family. 
(Schleicher 1863: 34) 


We are actually able to trace directly in many idioms that they have branched off 
into several languages, dialects & c., for we are in a position to follow the course of 
some, nay, of whole families of them during a period of more than two thousand 
years, since a faithful picture of them has been left us in writing. This, for instance, 
is the case with Latin. (Schleicher 1863: 41-42) 


4 Max Müller 


Darwin learned of Schleicher's work in an indirect way but in time to mention it in his 
second book, a dozen years after the publication of his first (Darwin 1871). The 1863 pam- 
phlet received a very positive brief anonymous review in a short-lived British weekly, 
the Reader, in 1864 but appears to have attracted little attention in England, where Ger- 
man was not commonly read.? The English translation, though, published in 1869, caused 
a stir. Max Müller himself, the best-known linguist and popularizer of language study 
in the country, reviewed it at some length in the first volume of Nature, a successor to 
the Reader as a general science periodical for the public (Müller 1870). Darwin quoted 
from this review in his most famous passage on language. In the review, Müller acknowl- 
edged the power of Schleicher's analogy: “He thinks rightly that the genesis of species, 
as explained by Mr. Darwin, receives a striking illustration in the genealogical system of 
languages" (Müller 1870: 257); “No reader of Mr. Darwin's books can fail to see that an 
analogous process pervades the growth of a new species of language, and of new species 
of animal and vegetable life" (Müller 1870: 258). 

Müller disagreed with Schleicher on a number of points. The least noticed but most 
insightful was his objection to Schleicher's claim that linguistics is a natural science. It is 
natural only in that "languages are not produced by the free-will of individuals . . . [T]he 
freedom of the individual is necessarily limited by the pressure exercised by all upon all. 
Speech in its very nature is mutual" (Müller 1870: 258), a point that presaged Saussure's 


? Alter (1999) speculates that the writer of the Reader piece was Frederic William Farrar, who was later 
responsible, as Dean of Westminster Abbey, for Darwin's interment there. 
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observation on the social nature of langue. The second point was Müller" e hobby-horse, 
his idiosyncratic idea that “In tracing the origin of species, whether among plants or 


animals, we do not begin with one perfect type of which all succeeding forms are simple 
modifications. . . It is the same with languages” (Müller 1870: 258). Müller here betrays 
his complete misunderstanding of Darwinism as well as of mainstream linguistics of his 


time, 


whose proponents - most prominently William Dwight Whitney - he sparred with 


throughout his career (Whitney 1875; Alter 1999). It is no wonder that Darwin had little 
use for Moller, with whom he also disagreed on a more fundamental issue: the continuity 


of humans with other creatures. This passage of Müller's is important enough to be cited 


in its 


entirety: 


A much more striking analogy, therefore, than the struggle for life among separate 
languages, is the struggle for life among words and grammatical forms which is 
constantly going on in each language. Here the better, the shorter, the easier forms 
are constantly gaining the upper hand, and they really owe their success to their 
own inherent virtue. Here, if anywhere, we can learn that what is called the process 
of natural selection, is at the same time, from a higher point of view, a process of 
rational elimination; for what seems at first sight mere accident at the dropping of 
old and the rising of new words, can be shown in most cases to be due to intelligible 
and generally valid reasons. Sometimes these reasons are purely phonetic, and 
those words and forms are seen to prevail which give the least trouble to the organs 
of pronunciation. At other times the causes are more remote. We see how certain 
forms of grammar which require little reflection, acquire for that very reason a 
decided numerical preponderance; become, in fact, what are called regular forms, 
while the other forms, generally the more primitive and more legitimate, dwindle 
away to a small minority, and are treated at last as exceptional and irregular. In 
the so-called dialectic growth of languages we see the struggle for life in full play, 
and though we cannot in every instance explain the causes of victory and defeat, 
we still perceive, as a general rule, that those forms and those words carry the day 
which for the time being seem best to answer their purpose. (Müller 1870: 258) 


Darwin evidently approved of the argument, for he was generous enough to cite 


Müller's last point in The descent of man, and selection in relation to sex the following 


year: 


3 


We see variability in every tongue, and new words are continually cropping up; but 
as there is a limit to the powers of the memory, single words, like whole languages, 
gradually become extinct. As Max Müller has well remarked: — “A struggle for life is 
constantly going on amongst the words and grammatical forms in each language. 
The better, the shorter, the easier forms are constantly gaining the upper hand, 
and they owe their success to their own inherent virtue.” To these more important 


? Dingemanse (2013) is a fascinating blog-post about the evolution of the citation of this passage in the last 
decade. Most have attributed the observation to Darwin and neglect to note that Darwin had himself 
directly credited and cited Müller. 
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causes of the survival of certain words, mere novelty may, I think, be added; for 
there is in the mind of man a strong love for slight changes in all things. The 
survival or preservation of certain favoured words in the struggle for existence is 
natural selection. (Darwin 1871: 60-61) 


5 Darwinism and the laws of language 


With Miller's approving review of Schleicher’s pamphlet, and more importantly with 
Darwin’s endorsement, linguistics had entered the mainstream of scientific discourse, a 
long-time goal of its practitioners that persists to this day. But not for long. In the end, 
the most important legacy of Darwinian thinking for 19'* century linguistics was the 
clarification of the scope of the term law as it applies to language. As late as Müller's 
review, linguists could still see their ultimate goal as the formulation of general laws of 
language origin and structure on a par with the laws of physics and chemistry, perhaps 
based on Darwin's theory of evolution. What linguists discovered instead were the one- 
off contingent laws of sound change, from Grimm's law to Verner's law, startling but of 
no general significance beyond their purported exceptionlessness. Linguists could still 
call them laws, though, not on account of generality but because of their regularity. This 
was, however, small consolation. As Osthoff and Brugmann so memorably declared in 
their Neogrammarian credo: 


First, every sound change, inasmuch as it occurs mechanically, takes place accord- 
ing to laws that admit no exception. That is, the direction of the sound shift is 
always the same for all the members of a linguistic community except where a 
split into dialects occurs; and all words in which the sound subjected to the change 
appears in the same relationship are affected by the change without exception. (Os- 
thoff & Brugmann 1878/1967: 204) 


Hermann Paul stressed a couple of years later that sound laws in no way resemble 
those of physics and chemistry, but were statements of regular but contingent historical 
facts. To those who had aspired to gain for the science of language a place among the 
natural sciences, laws of this character would have been disappointing: 


Can we assert uniformity of sound-laws? In the first place, we must fully under- 
stand what we mean, generally speaking, by a sound-law. The word ‘law’ is itself 
used in very different senses, and this fact induces errors in its application. The 
idea of sound-law is not to be understood in the sense in which we speak of ‘laws’ 
in Physics or Chemistry, nor in the sense of which we were thinking when we con- 
trasted exact sciences with historical sciences. Sound-law does not pretend to state 
what must always under certain general conditions regularly recur, but merely ex- 
presses the reign of uniformity within a group of definite historical phenomena. 
(Paul 1880/1889: 57) 
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6 Ferdinand de Saussure and the end of evolution 


The greatest individual achievement of Neogrammarian historical linguistics was Saus- 
sure’s Mémoire sur le systéme primitif des voyelles dans les langues indo-européennes. Writ- 
ten in a fury and printed in fascicles one by one in 1878 when Saussure was a 21-year-old 
student, it was the only sizable work of his lifetime. He published brief articles on scat- 
tered topics afterwards, apparently unable to find a unifying vision. 

Saussure returned to his native Geneva in 1891 as Extraordinary Professor of Indo- 
European languages. He was not named Ordinary Professor of General Linguistics until 
December of 1906, an additional responsibility added to Sanskrit, comparative philology, 
and the occasional language course. Between 1906 and 1911, he gave three biennial series 
of lectures on general linguistics as required by his new position, all covering similar 
ground, to his devoted but few students and colleagues (Joseph 2012: 617). We will never 
know whether he intended to publish this work. Saussure died in February 1913. His 
students gathered their notes together and organized them, publishing the results in 1916 
in tribute to their late master as the Cours de linguistique générale, whose reputation 
gradually grew, until it became justly regarded as the founding document of modern 
theoretical linguistics. The Course in general linguistics remains influential today across 
a broad range of disciplines. 

The Course in general linguistics comprises for the most part an attempt to establish a 
new foundation for the science of language. The first chapter of the Cours, “A glance at 
the history of linguistics,” is five pages long in Baskin’s 1959 translation. Of traditional 
grammar, Saussure writes that “It lacked a scientific approach” (Saussure 1959: 1). He 
credits Bopp with “realiz[ing] that the comparison of related languages could become 
the subject matter of an independent science” (Saussure 1959: 2), noting that he could 
not have succeeded without Jones’s ‘discovery’ of Sanskrit, which is “exceptionally well- 
fitted to the role of illuminating the comparison [with Greek and Latin]” (Saussure 1959: 
2). Importantly for both Sausssure and us, “the comparative school . . . did not succeed 
in setting up the true science of linguistics. It failed to seek out the nature of its study” 
(Saussure 1959: 3). “Not until around 1870 did scholars begin to seek out the principles 
that govern the life of languages” (Saussure 1959: 4). He credits Whitney (1875) and the 
Neogrammarians with realizing that language is “a product of the collective mind of lin- 
guistic groups” (Saussure 1959: 5) and not “an organism that develops independently” 
(Saussure 1959: 5), as Schleicher had claimed in rhetorical support of his Darwinian ar- 
gument. Saussure concludes the chapter by stating that “the fundamental problems of 
general linguistics still await solution,’ a solution that he proceeds to outline in the rest 
of the book. 

One of the most important components of Saussure’s solution was the observation 
that the science of language could be divided into the analysis of single states of a lan- 
guage, états de langue — synchronic linguistics - and the analysis of a succession of such 
states — diachronic linguistics. Tellingly, the chapter devoted to this fundamental dis- 
tinction was entitled "Static and evolutionary linguistics,’ (Saussure 1959: 79), a title that 
echoes the earlier connection between linguistics and evolutionary biology. He used the 
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term diachronic interchangeably with evolutionary but diachronic eventually won out, 
presumably because it lacked any suggestion of a connection to biological evolution; 
perhaps also because of the bad reputation that the term evolution had gained when ap- 
plied to the study of the human language faculty since being banned by the Société de 
Linguistique de Paris in 1866. Saussure made clear that synchrony was more important 
in the last two paragraphs of the chapter: 


Synchronic linguistics will be concerned with the logical and psychological relations 
that bind together coexisting terms and form a system in the collective mind of 
speakers. 


Diachronic linguistics, on the contrary, will study relations that bind together suc- 
cessive terms not perceived by the collective mind but substituted for each other 
without forming a system. 


It is fair to say that Saussure’s distinction was the most important factor leading to a 
shift in the focus of linguistics in the century since. In the last chapter of the book, “Con- 
cerning retrospective linguistics,’ Saussure reduces diachrony to synchrony and thus 
dispenses with the former in a single argument. Diachrony, for Saussure, is simply “an 
infinite number of photographs, taken at different times” (Saussure 1959: 212). 

The photographic analogy is striking. The Lumiére brothers had perfected the ciné- 
matographe in 1895, only a decade before Saussure’s first lectures on general linguistics 
and it had quickly grown in popularity in Saussure’s French-speaking world. Cinematog- 
raphy allowed for the depiction of passage through time as a sequence of successive pho- 
tographs or states: a moving picture is a succession of photographs shot and projected 
at regular very short intervals. The individual photographs matter much more than the 
interval, which is always the same. No single transition is of interest. The depiction of 
the passage of time is simply an illusion created by the sequence of static photographs. 
We cannot know if the cinema had any influence on Saussure’s thought, but he was 
very clear in asserting that “From the speakers’ point of view, diachrony does not exist; 
speakers deal only with a state.” (Joseph 2012: 594). 

Saussure had struggled through his life with the problem that the field to which he 
had devoted his career, evolutionary historical linguistics, had not been able find any 
principled theoretical basis. Once he sat down to provide a theory of language, the re- 
sult was a theory of what we now call linguistic structure or grammar, not of historical 
linguistic evolution. He could come to terms with this conclusion only by killing the field 
that had borne him. His reduction of diachrony to synchrony and his double insistence 
that a linguistic system must be synchronic and that diachronic linguistics is not sys- 
tematic in this sense was the most important factor leading to the radical shift that the 
field underwent in the next few decades. By 1945, the synchronic system and Saussure 
had won. Modern linguistics was synchronic linguistics and attempts to tie linguistics 
to evolution in any way had been abandoned.’ 


4 As late as 1929, though, Edward Sapir could still proudly proclaim, in an article entitled “The status of 
linguistics as a science,’ that “Many of the formulations of comparative Indo-European linguistics have 
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7 Principles of cultural evolution 


Historical or evolutionary linguistics had been one of the most successful academic enter- 
prises of the nineteenth century, amassing concrete results such as the establishment of 
historical language families and the reconstruction of a number of proto-languages. Saus- 
sure’s conjecture on the vowel system of Indo-European, for example, was confirmed 
by the decipherment of Hittite in the early 20'^ century and the observation made by 
Kurylowicz (1935) that a number of consonants in the recently deciphered ancient Hit- 
tite language, not found in other Indo-European languages, lined up nicely in their dis- 
tribution with the coefficients sonantiques that Saussure had proposed based solely on 
historical analysis. The problem that Saussure confronted in his theoretical work was 
that, unlike Darwinian evolutionary biology, which was grounded in the great insight 
of natural selection, the field had no explanatory basis. His solution was to dismiss the 
field. 

All biologists agree that, in the memorable words of Theodosius Dobzhansky (2013), 
“nothing in biology makes sense except in the light of evolution" Darwin's theory of 
natural selection provides a satisfying sweeping explanation for the origin and evolution 
of all biological species, while the modern synthesis provides the genetic mechanism that 
underpins reproductive success. As the Neogrammarians noted themselves (see above), 
there are no equivalent principles in historical linguistics. All the ‘laws’, exceptionless 
though they may be, are contingent facts. 

Can there be general principles of linguistic change?" Saussure certainly did not pro- 
pose any, but there are reasons for optimism. William Labov, whom many consider to 
be the most important linguist of our time, published a massive work in three volumes 
(Labov 1994; 2001; 2010) entitled just that: Principles of Linguistic Change. Blevins (2004) 
has written an influential book entitled Evolutionary phonology: The emergence of sound 
patterns. Others, notably Boer (2001), Galantucci (2005), Simon Kirby (Verhoef, Kirby 
& Boer 2014), and Kenny Smith (Kirby et al. 2015) have looked at emergent systems of 
language based on evolutionary models. None of these have direct ties to Darwinian the- 
ory. Some, though, are firmly within the tradition of what has come to be called cultural 
evolution. 

The founder of cultural anthropology, Edward B. Tylor, had embraced evolution from 
the start (Tylor 1871) but he entwined evolution with material progress, providing fodder 
for a long detour into social Darwinism, an idea also traceable in part to Ernst Haeckel, 
and to related eugenics movements around the globe. Franz Boas, the most prominent 
anthropologist of his time, took up the flag against all things evolutionary in culture 
(Boas 1928) and drove evolution from the field for close to a century in both anthropology 
and the closely related sociology. The idea that culture could and should be studied 
from an evolutionary point of view raised its head occasionally (e.g., Sahlins 1960), with 
Marvin Harris especially exploiting the notion of an ecological niche as an explanatory 


a neatness and a regularity which recall the formulae, or the so-called laws, of natural science? (Sapir 
1929: 160). One could write a book about this confusion of formulae with laws. Saussure understood the 
difference. 

5 Kurylowicz (1935) set out a set of six laws of analogy, but these are far from general principles. 
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device in cultural adaptation (e.g., Harris 1979), but the most important development in 
anthropology was the anti-empirical direction of the field under the influence of Geertz 
(1973), which was profoundly anti-evolutionary and even anti-explanatory. This line of 
thought soon led to a great weakening of the scope of the field of cultural anthropology. 

The disintegration of cultural anthropology left an opening for a more biologically ori- 
ented study of human behavior, often called behavioral ecology. A major thrust of this 
biological approach was the direct application of insights from evolutionary biology to 
cultural evolution. One of the most influential lines of work in this direction was led by a 
duo made up of a biologist (Peter Richerson) and an anthropologist (Robert Boyd), who, 
over a thirty-year collaboration, have written three influential books (Boyd & Richerson 
1985; 2005; Richerson & Robert 2005) and many articles in which they directly and pre- 
cisely apply principles from evolutionary theory to human culture, with many examples, 
works which have unfortunately had little influence in linguistics. Their framework pro- 
vides a simple answer to Saussure’s concern about the lack of principled explanation in 
evolutionary linguistics: cultural evolution can be explained using precise methods and 
we can start by exploiting principles taken quite directly from evolutionary biology. Ap- 
plying these methods to language requires that we first step back from the position that 
has dominated our field for the last half century and accept Sapir’s position: language 
is a product of the interaction of biology and culture and we cannot understand it by 
confining ourselves to one or the other. Once we adopt this position, we can look at how 
languages evolve culturally, on the basis of well-established principles. 

Taking direct inspiration from Richerson and Boyd, over the last half decade I have 
shown that a simple well-known principle from ecology, Gause's law of competitive 
exclusion (Lotka 1925; Volterra 1926; 1931; Gause 1934), provides very satisfying explana- 
tions for a variety of important long-standing problems in morphology and lexicology, 
including the absence of lexical synonyms, morphological productivity, allomorphy, the 
existence of inflectional classes, and the relation between morphology and writing (Lind- 
say & Aronoff 2013; Aronoff & Lindsay 2015; 2016; Berg & Aronoff 2017). 


8 Conclusion 


The moment I discovered Gause's principle, I was seized by an Andersonian impulse that 
Icould not shake until I had satisfied myself that I understood what had had happened in 
the relationship between linguistics and evolutionary biology since Darwin and Schle- 
icher. Why, as Morris Halle pronounced many years ago, did Saussure never publish 
the Course in general linguistics if it was so important? Why does mainstream academia 
pay lip service at best to Saussure's most accomplished work, the Mémoire sur le systéme 
primitif des voyelles dans les langues indo-européennes? Why did this greatest linguist 
of his time publish so little? Why did historical linguistics, the most successful human 
science of the 19'^ century, fall into the tenuous position that it holds today? And finally, 


é This research has nothing to say about the evolution of the language faculty, only about the evolution of 
individual languages. 
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how should we approach the relation between evolutionary theory and linguistics to- 
day? In this piece, I have begun to answer these questions for myself, in the profound 
belief, which I share with Steve Anderson and Peter Matthews, that we cannot get last- 
ing answers unless we understand the basis of the questions that drive our work and 
thought. Steve himself has recently written cogently about taking evolutionary biology 
seriously in any discussion of language. Notably, in Anderson (2013), he has reminded 
us that that the general properties of languages are not necessarily attributable solely 
to the language faculty. Many may have an external basis, and some may have been 
incorporated into the language faculty by natural selection itself. 
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Speech and birdsong are complex motor behaviors in which patterning over time is itself 
informational. This is obvious in the case of speech, but in birdsong, too, the sequencing (and 
possibly timing) of syllables determines in part the well-formedness of the song. Despite 
gross differences in function, in the physical substrate (method of sound production), in 
brain structure, and in the scale of the animals, recent work has revealed a surprising degree 
of similarity in their solutions to the problem of controlling temporal patterning. There are 
differences, too, of course, and when we find them, it deepens our understanding about the 
(unique) structure of speech. Because Steve has had a lasting interest in birds and birdsong 
(Anderson 2006), this seemed to be an appropriate context to review these similarities. Two 
of them will be the focus of discussion here: decomposition of the behavior into a sequence 
of discrete motor units and the role of an internal clock system, partly independent of the 
units themselves. 


1 Discrete decomposition 


One of the foundational motivations for Articulatory Phonology (Browman & Goldstein 
1992; 1995) is to address the apparent incompatibility between the discrete phonetic and 
phonological structure of speech, on the one hand, and the observation that the vocal 
tract articulators move in a continuous fashion, producing continuous modulation of 
the acoustics, on the other. AP hypothesizes that it is possible to model the continu- 
ous motion of the articulators as arising from discrete, context-independent dynamical 
control systems, called gestures, that govern the formation of phonologically-relevant 
constrictions within the vocal tract (for example bringing the tongue tip to the palate, 
with a particular degree of constriction). The control parameters of these dynamical sys- 
tems (target, e.g., the phonologically-specified degree of constriction, and stiffness, the 
time constant that determines the amount of time required for the system to settle at 
its target value) remain fixed during the duration of the constriction action (roughly a 
consonant or a vowel), even though the articulators are moving. The decomposition of 
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speech into a pattern of gesture activations over time (or gestural score), is an abstract 
analysis (as Steve argued in his earliest work (Anderson 1974) must be the case for a 
phonetic representation), and can only be discovered by use of explicit dynamical and 
acoustical models. We cannot observe it directly. Recently, Nam et al (2012) showed that 
with the use of the TaDA gestural production model, it is possible to parse an acoustic 
signal into the maximum likelihood gestural score that could have produced it. 

A similar strategy is employed by Amador et al. (2013), to decompose zebra finch song 
into discrete gestures. The authors first developed a dynamical model of sound produc- 
tion in the syrinx, the sound production organ in songbirds. The syrinx is a vibratory 
system located at the base of the trachea. The trachea divides into two tubes at that 
point, each of which hosts a pair of vibrating membranes called labia. The two pairs of 
labia can vibrate together or separately, or can be sequenced (Riede & Goller 2010). The 
model developed in Amador et al. (2013) (for a single pair of labia) allowed them to gen- 
erate sound from two dynamical control parameters: the average tension in the labia, 
and tracheal pressure. Then, using a table-lookup scheme, they were able to estimate 
the time functions of these two parameters from audio recordings of the sound. This 
representation was then validated by generating audio from those time functions and 
playing those sounds to zebra finches. Neural responses from the synthetic song were 
highly similar (nearly identical) to the responses obtained by playing the original song 
(BOS, Bird’s Own Song). Next, they showed that the derived time functions are essen- 
tially discrete: they exhibit sequences of intervals of time during which the values of the 
control parameters remain essentially fixed (just as with speech gestures), and refer to 
these intervals as elemental gestures of the song. The key similarity to speech gestures 
is that the continuous song can be decomposed into discrete intervals of time, longer 
by an order of magnitude than the periodicity of the song, during which the dynamical 
parameters are essentially fixed. 

There are also some salient differences between speech gestures and zebra finch song 
gestures. Most superficial is that the control parameters for the zebra finch gestures are 
different from those of speech gestures that control the constrictions of the suprlaryngeal 
structures (as expected, because forming constrictions is not generally thought to be part 
of the bird’s song behavior), but they are similar to those control parameters involved 
in controlling tone and intonation in speech (McGowan & Saltzman 1995). A somewhat 
deeper difference is that the zebra finch gestures (as analyzed in Amador et al. 2013) are 
strictly sequential, while speech gestures exhibit various types of overlap in time. At 
first blush, this makes sense, as the mechanism of sound production in birds is generally 
thought to be limited to a single device, the syrinx, while the distinct vocal organs of the 
human vocal tract can each make their own contribution to the filtering action of sound 
generated at the larynx. There are, however, as noted above, two sets of labia comprising 
the syrinx (Trevisan et al. 2007), and some avian species employ primarily one set, while 
others (like zebra finches) use both sets, simultaneously or sequentially. Trevisian et al. 
(2007) have shown how such symmetry-breaking (different functioning of the two sides 
at the same time) arises in the species that employ either one side only or both sides, 
but not symmetrically. So there is the possibility that distinct patterns of gesture overlap 
may yet be uncovered. 
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Another obvious potential difference between speech gestures and birdsong is com- 
positionality. A small set of discrete speech gestures are employed in different combina- 
tions to create the set of segments and syllables in a language. It is unknown whether 
gestures in birdsong are compositional in this sense. Zebra finch songs (on which a large 
bulk of the research on song birds has been done) have been described as having a hierar- 
chical structure. Syllables are the most immediately identifiable units, as they consist of 
the vocalization intervals produced by the bird that are bounded by (silent) inspirations. 
The total number of distinct syllables in a given bird’s inventory is relatively small, on 
the order of 20 or so. As described in Yu & Margoliash (1996), syllables can be composed 
of distinct notes, and in turn, sequences of syllables form motifs, that can be repeated as 
part of a song. It is unclear to what extent the elemental gestures identified in Amador 
et al. (2013) can form parts of more than one syllable in an individual’s inventory. Yu 
and Margoliash (1996) do report instances of distinct syllables in a bird’s inventory that 
begin with the same note (and end with different notes). To the extent that those shared 
notes are produced with the same gestures, this would be evidence for some limited 
compositionality. 


2 Clocks 


2.1 Clocks in speech? 


A lively debate in the 1980’s sparked by the work of Carol Fowler (Fowler 1980) con- 
sidered whether speech units had their own intrinsic timing as dynamical systems (as 
argued by Fowler), or whether the timing is imposed externally by some kind of cen- 
tral clock. Gestures in Articulatory Phonology are units with intrinsic timing; the time 
required for a gesture to reach its goal state is determined by its dynamical stiffness pa- 
rameter. But what of the time between gestures? For a sequence of two gestures, x; and 
X5, we can ask how the system controls when to trigger x? with respect to xı. A natural 
answer to this is that x2 is triggered when some reference state value of x, is achieved 
{x1,1}. Sequencing in motor systems is often modeled by a mechanism of “competitive 
selection" of the sequenced items (Bullock & Rhodes 2002; Grossberg 1978), where feed- 
back from the completion of element x; (achievement of its target state) allows it to be 
deactivated and element x; to be selected and triggered. In the case of speech, this feed- 
back could be kinesthetic, orosensory and/or acoustic. However, there is an argument 
that this cannot be the complete story for speech. Consider the gestural score for the 
word "spot" in the left panel of Figure 1. The boxes represent intervals in which the the 
supralaryngeal gestures would be active, in some token of this word. The onset of the 
lip closure for /p/ initiates at a moment when the tongue tip fricative gesture (for /s/) 
is at a particular state (close to its target and not moving much), as marked by the ver- 
tical line in the figure. A possible mechanism for sequencing the these gestures would 
be to learn that in producing the word "spot; the lip closure gesture “waits” until the 
system has feedback that the tongue tip is near the alveolar ridge and is moving with 
little velocity, and at that point it is triggered. But now consider the phrase "toss spot" 
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shown on the right side of Figure 1. Here, the state that the lip closure is looking for, in 
order to trigger, occurs too early, because the tongue tip is already in position for /s/. 
And there is a interval of time (shaded in white) during which the state of the tongue tip 
does not change much, so there is no information in its state that can inform when the 
/p/ should be triggered. Nonetheless, its timing must show some regularity, as the [s] in 
"toss spot" systematically differs in duration from, for example “pa spot? Figure 2 shows 
the kinematics of the tongue tip in a sequence of identical supralaryngeal gestures (“had 
tied”). It is clear that there is indeed a considerable stretch (70 ms or so) marked with a 
yellow box where its state does not change, and so information about when to trigger a 
next gesture is lacking. 

This case appears to argue that a simple state-based chain triggering will not work for 
speech, in the general case, and relative timing must be specified in some way separately 
from the actual gestural content of the units. It would also be possible to argue, in this 
case, that the lip gesture is triggered when the tongue tip gesture begins to release, but 
this just pushes the problem back onto the tongue tip release gesture. How does it know 
know when to trigger, without access to information about time? It can’t just use the 
position and velocity of the tongue tip. Tilsen (2016) has recently proposed a theory that 
gestural sequencing based on feedback from the preceding gestures does indeed charac- 
terize the system at early stages of the child’s development, gradually shifting to a differ- 
ent, coordination-based scheme as described below (for at least some syllable-contexts). 
It could also be countered that the cases presented here involve timing across words, 
and perhaps word-sequencing is controlled by a separate mechanism from within-word 
gesture sequencing. However, the same issue would arise within words in the case of 
geminate consonants (consonants that are maintained for a long temporal interval). And 
of course, the existence of geminates at all is itself prima facie evidence for some inde- 
pendence of timing and gestural content, as the same constrictions can be maintained 
for different durations, under linguistic control. 


"spot" "toss spot" 


TT 
TB 


LIPS 


time time 


Figure 1: Gestural scores. On the left for the word “spot” and on the right for 
the phrase "toss spot" Rows represent (from the) top, gestures of the Tongue 
Tip, Tongue Body and Lips. See text. Shaded area represents interval of time 
during which state of tongue tip is not changing. 
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“had tied” 


= [dt] [d] 
venne n RB o 


100 ms 


Figure 2: Time functions of the vertical position of the Tongue Tip (TT) and 
Tongue Dorsum (TD) in the phase “had tied.” See text. 


The coupled oscillator model of syllable structure (Goldstein, Byrd & Saltzman 2006) 
proposes a specific alternative to gesture sequencing, in which the clock machinery is 
separate from the particular gestures that form the syllable. In this model, the gestures 
composing a syllable are triggered by a system of planning oscillators (clocks) that are 
coupled to one another in distinct modes. Each planning oscillator triggers activation of 
a gesture. Specifically, clocks that trigger gestures comprising onset consonants (conso- 
nants preceding the vowel in a syllable) are coupled in-phase (the most stable mode) to 
the vowel gesture and clocks that trigger gestures comprising coda consonants (conso- 
nants following the vowel in a syllable) are coupled in anti-phase mode to the vowel. If 
every gesture is triggered at phase 0 degrees of its planning oscillator, then two gestures 
that are coupled in phase will be triggered synchronously. This synchronous triggering 
explains data that show that the onset of articulatory movement for an onset consonant 
and for the following vowel begin at roughly the same time (Goldstein, Byrd & Saltz- 
man 2006). When two gestures are coupled in anti-phase mode, however, they will be 
triggered a half-period apart in time, which would be consistent with the observed time 
lag between the onset of the vowel gesture and the onset of a coda consonant gesture 
(Goldstein, Byrd & Saltzman 2006). The ensemble of oscillators can be formally repre- 
sented as a (coupling) graph, and Figure 3 shows the coupling graph for the word "tab? 
Green edges represent in-phase coupling and the dashed red edge represents anti-phase 
coupling. Note that the same graph topology would underlie the timing of gestures in 
any CVC syllable, and in that sense, the clock is separate from, and independent of, the 
particular gestures that are deployed. The model has been used to explain patterns of syl- 
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lable typology, acquisition (Nam, Goldstein & Saltzman 2009), asymmetric coordination 
patterns in onset vs coda (Marin & Pouplier 2010), and weight, and it has been used as a 
diagnostic for the syllable structure of complex pre-vocalic clusters (Hermes, Muecke & 
Grice 2013; Shaw et al. 2009). 


clo lab 


— 


Figure 3: Coupling graph for the word "tab" Each clock represents one of the 
gestures in the word and they are the nodes of the coupling graph. From the 
top down the left, these are glottal abduction for the initial /t/, pharyngeal con- 
striction (for the vowel), and tongue tip closure for the /t/. At the right is the lip 
closure for /b/. Green lines are graph edges that represent in-phase coupling 
and the red dashed line with arrowhead is the edge that represents anti-phase 
coupling. Boxes represent the gestural score for the the word (gesture activa- 
tion over time) that results from running the coupled oscillator model. 
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The frequencies of the planning oscillator clocks are all defined with reference to the 
ticks of an overall speech rate clock. Prosodically-induced lengthening can thus be mod- 
eled as slowing of the rate of this overall clock, as has been proposed in the z-gesture 
model (Byrd & Saltzman 2003). Phrase edges are associated with local z-gestures, which 
function to slow the movements of all gestures that fall under the scope of the z-gesture. 
This model has been shown to account for the acoustic and kinematic correlates of boun- 
dary lengthening in a variety of languages (e.g. Greek; Katsika et al. 2014). 


2.2 Clocks in birdsong? 


Work on birdsong over the last 15 years has also revealed, within limits, separate control 
of timing and vocal organ activation patterns. Two areas of the avian cortex have been 
identified as significant for the production of the song: HVC, a pre-motor nucleus, and 
RA (robust nucleus of the arcopallium). HVC projects to RA, which in turn projects to 
the vocal motor neurons (and to midbrain vocal control areas). HVC was suspected to 
be a major site of timing control, and this was tested in a seminal study by Long and Fee 
(2008). Reasoning that cooling a brain region would result in a slowing of neural pat- 
terns, they used a miniature Peltier device to locally cool either HVC or RA. They found 
that cooling HVC resulted in slowing of the song, with the amount of slowing being 
proportional to the degree of cooling. Further, the slowing was fairly linear throughout 
the song. Syllable durations, onset lags, gap durations between motifs were all slowed 
to roughly the same degree, indicating that something like an overall clock (like the 
proposed speech clock) was being slowed. Consistent with the independence of timing 
account, there was very little change at all in the actual acoustics of the song, indicating 
that the control of the activation patterns at the level of the motor neurons remained in- 
tact, just spread out in time. (In other words, the rate at which the motor commands were 
issued was slowed down, but the commands were not changed, so the frequencies of the 
song were not altered by slowing). Conversely, even though spiking was decreased by 
cooling RA, the ability of (uncooled) HVC to drive RA and produce typical song speeds 
was not impaired, thus providing evidence for localizing timing control in HVC. 

A more recent study of cooling from a different lab using canaries (Goldin et al. 2013) 
found that with more extreme cooling of HVC, the song begins to break down, exhibiting 
period-doubling of respiratory patterns, causing the emergence of additional syllables. 
The authors provide a formal model that predicts these transitions from the nonlinear 
interaction between the (hypothesized) neural pulse train (from HVC) and the dynamics 
of the respiratory cycle. Interestingly, this kind of period-doubling can also be observed 
in “gestural intrusions” human speakers produce when repeating phrases like “top cop”, 
and a similar dynamical account has been proposed, less formally (Goldstein et al. 2007). 
That study found that when speakers produce such phrases repeatedly, they will begin 
to produce an "extra" copy of the tongue tip gesture of /t/ concurrently with the initial 
tongue dorsum gesture of "cop" (resulting in a co-produced /Kt/) and conversely an extra 
tongue dorsum gesture during the initial tongue tip gesture of “top”. These extra cycles of 
repeated tongue tip or tongue dorsum movement can be analyzed as a period doubling 
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- 2:1 to 1:1 transitions in frequency mode locking between the tongue tip (or tongue 
dorsum) oscillators and the lip oscillator of the syllable-final lip gesture (there is a lip 
gesture in every syllable, but a tongue tip or dorsum gesture only every other syllable; 
Goldstein et al. 2007). Since such period-doubling transitions in birdsong are analyzed by 
Goldin et al. (2013) as resulting from a presumed slowing of a clocking pulse in HVC, the 
results do not contradict the main finding and conclusion of the earlier work of Long and 
Fee. However, there is disagreement between the two research groups as to the nature 
of the temporal code in HVC and how it interacts (or not) with the rest of the system, as 
will be fleshed out a bit in the last section. 


2.3 Brain-cooling in speech 


The technique of focal brain cooling was recently employed with humans for the first 
time (Long et al. 2016) with patients undergoing brain surgery for either epilepsy or 
tumor resection. Cooling was applied in up to 4 locations in each subject, two in Broca’s 
area within the left inferior frontal gyrus (IFG) and the others in the precentral gyrus 
(speech motor cortex). The hypothesis was that there would be a double dissociation 
with cooling in Broca’s area causing changes in speech timing but not in articulatory 
quality, and that cooling the speech motor cortex would disrupt articulation, but not 
timing. Patients were recorded producing the digits from 1 to 20 or the names of the 
days of the week (one sequence per trial, with breaks between trials) while respective 
sites were being cooled, and also during control trials with no cooling. The utterances 
were judged for quality via crowd-sourcing on a scale from 0 (extremely degraded) to 
1 (typical/normal). Timing was determined through durational measurements. Results 
supported the double dissociation. Cooling Broca’s area resulted in changes to speech 
timing. Typically utterances were slowed down (both the actual articulation of words 
and the gaps between them were stretched), but some cases speeding up occurred. No 
effect was found on judged quality. When the speech motor cortex was cooled, ratings 
shifted to more degraded, with no effect on timing. 

To examine the slowing more carefully, the authors generously made available the 
data from two of their subjects, one of which is analyzed here. Figure 4(a-b) shows 
boxplots for durations of the names of the days of the week (excluding pauses between 
names); on the left, control utterances are displayed and on the right, the trials with 
cooling of Broca’s area. Results show fairly uniform slowing across the names of the 
days of week, except for "Friday; which shows less slowing. Somewhat surprising is that 
Friday is the shortest of all the words (even in the controls); there is certainly no tendency 
for list-final prosodic lengthening here. In terms of intonation, M-W generally appear 
to be produced with an extended High tone. Falling begins on “Thursday” and “Friday” 
is generally produced on a Low tone. So it is possible that the durations follow the 
prominence profile of the utterance. By itself, however, this does not explain the reduced 
percentage of slowing on “Friday.” Another possibility is that cooling in Broca’s area has 
a bigger effect on more complex syllable types (for example with coda consonants or 
clusters). The initial syllables of the days of the week all have closed syllables (with coda 
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consonants) except “Friday”. To test this, the durations of the initial syllables and final 
syllable (“day”) were analyzed separately, and the magnitude of lengthening of initial 
(dark blue bars) vs. final (light yellow bars) syllables are shown in Figure 5. Magnitude 
of lengthening is calculated as the ratio of the median duration of that syllable when 
cooled divided by the median duration of controls. The lengthening of the (open) syllable 
“day” is approximately the same across all the days’ names. The lengthening of the first 
syllable in “Friday” (an open syllable) is about the same as for “day”, while the other first 
syllables (that are closed) lengthen more. The most lengthening is observed on the first 
syllable of “Wednesday”, which is also the most complex syllable, closed with a coda 
cluster. This is consistent with the hypothesis that more complex syllables exhibit more 
slowing due to cooling in Broca’s area. 
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Figure 4: (a-b) Boxplots of duration of the names of days of the week from one 
patient. Control condition is shown in (a), cooling Broca’s area is shown in (b). 
(c-d) Boxplots of silent gap durations before the production of the names of the 
week Tuesday-Friday. Control condition is shown in (c), cooling Broca’s area 
is shown in (d). 


Figure 4(c-d) shows the durations of the silent period before initiation of the words 


“Tuesday” to “Friday” from the time of completion of the preceding word. This shows a 
strikingly different pattern from that exhibited by the word durations. The silent gaps 
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before “Tuesday” and “Friday” show very little effect, while the gaps before “Wednesday” 
and “Thursday” show almost 3:1 lengthening. The pattern is reminiscent of the classic 
pattern of recall in short-term memory (Deese & Kaufman 1957; Ebbinghaus 1885; Brown, 
Neath & Chater 2007). The items in the middle of the list have more competitors on either 
side and therefore more interference, although many other models have been proposed. 
Such results could be modeled by a competitive queuing model of sequence selection 
(Bullock & Rhodes 2002), depending on exactly how the parameters are set. In any case, 
it is clear that more is going on than just clock slowing when Broca’s area is cooled, 
unlike what is observed in zebra finch, though clock slowing is also going on. 
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Figure 5: Magnitude of lengthening of initial (dark blue bars) vs. final (light 
yellow bars) syllables of the names of the days of the week. Magnitude of 
lengthening is calculated as the ratio of the median duration of that syllable 
when cooled divided by the median duration of that syllable in the uncooled 
control condition. 


In summary, the results with humans generally confirm the dissociation of the tim- 
ing clock from the articulatory gestural patterning that it paces. Differently from birds, 
however, where gaps between syllables and motifs were slowed in a roughly similar 
way to the actual song syllables, the gaps between the days of the week showed marked 
differences in response to cooling, depending on position in the list (for the one patient 
examined). This suggests that even for an over-learned list, mechanisms of selection of 
discrete individual words must still be in play, while for the bird, the entire song may 
just “run off” at different rates. This may be related to the relative lack of flexibility in 
the zebra finch song. Also the possible effect of syllable complexity on the magnitude 
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of slowing also suggests that more is going on for humans when Broca’s area is cooled 
than just uniform slowing of the clock. 


2.4 Continuous vs discrete temporal representations 


The last point brings up the nature of temporal coding that characterizes the representa- 
tion in HVC of the bird (compared for example to the clock model proposed for speech 
discussed above). The work of Fee and collaborators has consistently supported the view 
that the representation is a continuous-time representation of the song (in 10 ms or so 
slices). This is based on the earlier discovery (Hahnloser, Kozhevnikov & Fee 2002) of in- 
dividual cells that burst sparsely in the song at a fixed lag from song onset. Theoretically 
then, there could be such cells for each 10ms sample in the sound, and they jointly pro- 
duce a continuous representation. Further, their hypothesis is that the continuous-time 
representation completely drives (or enslaves) the downstream activity in RA and the 
vocal muscles to reproduce the song (Long & Fee 2008), which is why the slowing does 
not result in distortions to the song (but cf. the results discussed earlier with extreme 
values of cooling). An alternative discrete view was proposed by the Margoliash and 
Mindlin group (Amador et al. 2013). After discovering that it was possible to decompose 
the dynamical parameters governing song production into discrete gestures, as discussed 
above, they found that burst times of HVC neurons projecting to RA tended to be syn- 
chronized with the gestural extrema, for 14 of the 15 sites they examined with recordings 
of single neurons. This is exactly what would be predicted by coupled oscillator model of 
syllable structure described above: the clock mechanism generates a sequence of bursts 
that trigger their corresponding gestures. However, attempts to replicate this finding 
with a substantially larger population of cells, in both Long’s lab (Picardo et al. 2016) and 
Fee’s (Lynch et al. 2016), failed to replicate this finding. It is unclear why this is, apart 
from possible differences in sites examined and the types of electrodes used. It would 
not be surprising to find that both continuous and discrete representations co-exist in 
different subpopulations of neurons. The discrete representation would be useful during 
learning to produce individual “pieces” of the song on the way to mastery (assuming a 
continuous representation of the target song in auditory areas is any case available to 
the system). Consistent with this, Lynch et al. (2016) did find evidence of 10-Hz rhyth- 
micity locked to song syllables, which was significant for HVC projections to Area-X 
(basal ganglia loop employed in learning) but not for HVC projections to RA. Given the 
stereotypy of zebra finch song, it is not surprising that a continuous-time representation 
could work. Obviously in the case of speech, we are capable of producing novel forms, 
and for that a discrete representation like the coupled oscillator model is really the only 
viable candidate (or compatibly, models like that of Bohland and Guenther, e.g., Bohland, 
Bullock & Guenther 2010). 
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3 Summary 


Speech and birdsong share the property that their production can be decomposed into a 
sequence of discrete motor actions. In addition, the control of those actions is governed 
by a separate timing representation. The nature of the timing representation appears 
to be substantially different however, possibly due to the essential combinatoriality and 
productivity of human speech, though there is a lot still unknown about both speech and 
birdsong in this regard. It is interesting to consider why they should be as similar they 
are. One functional similarity is that while they are both species-specific capabilities, 
in both cases the specific behavioral forms must be learned by individuals (in the bird 
species in which the song is learned from experience). There are other odd similarities 
as well, such as the compatible frequency of their syllable rates. This flies in the face of 
hypotheses that the duration of the syllables in speech is related to the natural frequency 
of the jaw (e.g., Davis & MacNeilage 2004). A more likely cause may be the similarity of 
their auditory systems. In any case, the existence of a model system that can be probed 
in ways that speech cannot provides the opportunity of deepening our understanding of 
speech, particularly when we observe the particular places in which the systems diverge. 
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Chapter 22 


“Constructions” and grammar: Evidence 
from idioms 
Julia Horvath 


Tel Aviv University 


Tal Siloni 


Tel Aviv University 


The paper presents results of our investigation of the distribution of idioms across diatheses 
(voice alternations) in English and Hebrew. We propose an account and discuss its con- 
sequences for idiom storage and its implications for alternative architectures of grammar. 
We provide evidence that idioms split into two distinct subtypes, which we label “phrasal” 
versus “clausal” idioms. Based on idiom surveys, we observe that phrasal idioms can be 
specific to the transitive, the unaccusative or the adjectival passive diathesis, but cannot 
be specific to the verbal passive. Clausal idioms, in contrast, do not discriminate between 
diatheses: they tend to be specific to a single diathesis. These findings, we argue, cannot be 
accommodated by a Construction Grammar approach, such as Goldberg (2006), which as- 
sumes knowledge of language consists merely of an inventory of stored ‘constructions’, and 
does not distinguish between a storage module versus a computational system, attributing 
all significant grammatical generalizations to inheritance networks relating stored entities, 
and general cognitive and functional constraints. An adequate theory of idioms must have 
recourse to a distinction between stored items and unstored derivational outputs, and to 
grammatical distinctions. We outline an account of the findings, distinguishing between 
diatheses according to where they are formed, and assigning different storage to idioms 
according to whether their head is lexical or functional. 


1 Introduction 


Theories of linguistic knowledge all assume a storage component, where the associa- 
tions of form and meaning are stored. There is a controversy as to the nature of this 
component, call it the lexicon: How much does it list? What does it allow? What else 
is there beyond the lexicon? In contrast with Generative Grammar, which assumes a 
modular, multi-component model (Chomsky 1965 and subsequent work), Usage-based 
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Construction Grammar (CxG) (e.g. Goldberg 2006) and similar work assume that hu- 
man knowledge of language is nothing more than a network of stored constructions. 
There is no faculty of language and no language specific mechanisms, no derivations, 
just a lexicon of constructions, labelled 'Construct-i-con', which includes morphemes, 
words, idioms, partially lexically filled as well as fully abstract phrasal patterns. Gener- 
alizations across languages are explained by general cognitive constraints together with 
the functions of the particular constructions. Language-specific generalizations across 
constructions arise via inheritance networks. 

The rationale behind the assumption of a construct-i-con is as follows: (i) Idioms, for 
the most part, involve an internal makeup consisting of phrasal units. Since their mean- 
ing is unpredictable and associated with the whole construction, they are most plausibly 
stored as constructions. (ii) The distinction between idioms and ‘other constructions’ 
(involving argument realization) is hard to detect in many instances, because often the 
specific meaning of a sentence not involving an idiom (in the traditional sense) seems 
better specified as a property of the construction, not as properties of the verb and of 
its complements (e.g., the ‘transfer of possession’ meaning of ‘He sliced Chris a piece of 
cake’ vs. the ‘caused motion’ interpretation of ‘He sliced carrots into the salad’, although 
both sentences feature sliced). Hence, constructions in general should be stored as such. 

Indeed, idioms exhibit an inherent duality. On the one hand, they are phrasal units 
with internal syntactic structure, and on the other, they are associated with an unpre- 
dictable, conventionalized meaning. Therefore, the question as to how they are stored 
is particularly intriguing. Given that they are grammatical constructs and interact with 
grammar (can be embedded, can allow passivization, etc.), they must be stored intra- 
grammatically, in the lexicon. This paper investigates the storage of idioms, aiming to 
shed light on the nature of the lexicon. Further, idioms are the archetypal construction 
to be stored a la CxG; therefore, they constitute a test case (for alternative conceptions 
of grammar and the lexicon) most favorable to CxG. So if our investigation of idioms 
finds that the storage they require is inconsistent with CxG’s central tenet that grammar 
is comprised of nothing but networks of stored ‘constructions’, this must be all the more 
so for more productive, prima facie compositional kinds of ‘constructions’. 

Investigating the distribution of idioms across diatheses (transitive, unaccusative, ad- 
jectival passive, and verbal passive), we observe contrasts between the cross-diatheses 
distribution of distinct types of idioms. One type of idiom (which we will label ‘phrasal’) 
distributes differently in the verbal passive diathesis versus the transitive, unaccusative 
and adjectival passive diatheses: it cannot be specific to the verbal passive, but can be 
specific to the latter diatheses. Another type of idiom (‘clausal’), in contrast, does not 
discriminate between diatheses in this way: Idioms of this type tend to be specific to a 
single diathesis. We then show that a construct-i-con type of theory cannot account for 
these findings. To account for these systematic distinctions, which idioms (the archety- 
pal ‘construction’ a la CxG) exhibit, the theory requires more than cognitive principles, 
reference to functional needs, and inheritance of properties between stored entities (‘con- 
structions’). 


1 This approach is also referred to as Cognitive Construction Grammar (CCxG); see Boas (2013) for an 
overview of this versus other varieties of construction grammar models. 
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The structure of the paper is as follows. Sections 2 and 3 draw a distinction between 
lexically headed idioms, which we label ‘phrasal’ idioms, and idioms headed by a senten- 
tial functional head, which we label ‘clausal’ idioms, and discuss each type (respectively), 
paying particular attention to their distinct distribution across diatheses. §4 offers ad- 
ditional evidence for the partition into phrasal and clausal idioms, and lays out the im- 
plications regarding a CxG-type model. §5 sketches an account for the findings in the 
framework of a derivational and modular architecture of grammar. 


2 Phrasal idioms 


It has sporadically been observed in the literature that the verbal (eventive) passive (e.g. 
sold in "Ihe first costumer was sold the car’) and the adjectival (stative) passive (e.g., 
shaven) differ regarding the distribution of idioms. While there do not seem to be idioms 
specific to the verbal (eventive) passive (i.e., idioms in the verbal passive that have no 
transitive (active) alternant), there are idioms specific to the adjectival (stative) passive 
(see Ruwet 1991 for English and French, and Dubinsky & Simango 1996 for Chichewa). A 
first quantitative survey of idiom dictionaries examining these observations is reported 
in Horvath and Siloni's 2009 study of Hebrew idioms: Out of 60 predicates sampled for 
4 diatheses — verbal passive, adjectival passive, transitive, and unaccusative- only the 
verbal passive exhibited no unique idioms. An idiom is considered ‘unique’ to a given 
diathesis o, if « does not share the idiom with its (existing) root-counterpart p, which o 
would most directly be related to by derivation. Specifically, verbal passives, adjectival 
passives, and unaccusatives are unique if there is no corresponding transitive idiom. 
Transitives are unique if there is no corresponding unaccusative idiom. Except for the 
verbal passive, all other three diatheses can head unique idioms.? This will be illustrated 
shortly with English examples in (4-6). 

Two observations are in order. First, the idioms mentioned in the above studies are 
all phrasal idioms (VP and AP) involving no sentential functional categories such as 
auxiliaries, negation, etc. Second, verbal passives in Hebrew are known to be rarer in 
spoken language in comparison to say English (Berman 2008), which may affect the 
inventory of verbal passive idioms in the language. 

In light of the above, we ran a parallel survey of English idiom dictionaries. We be- 
lieve such surveys are necessary for the study of idiom distribution, as speakers may 
sometimes have a hard time distinguishing whether a certain idiom variant exists and is 
commonly used or only could exist, i.e., is a priori possible, but is not documented. This 
is so because the spontaneous formation and learning of novel idiomatic expressions is 
part of speakers' linguistic competence. Also, knowledge of idioms varies considerably 
among speakers (similar to vocabulary knowledge). 


? The survey proceeded as follows. 60 predicates of each diathesis were sampled from a verb dictionary. 
The number of predicates out of the sample of 60 giving rise to unique phrasal idioms were counted. This 
was done by searches of idiom dictionaries, followed by Google searches to check occurrences of relevant 
root-mate idioms, and consultation of native speakers regarding the results. The number of unique idioms 
found: 0 verbal passive ones; 21 unaccusatives; 23 transitives; and 13 adjectival passives. 
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We have systematically distinguished between phrasal and clausal idioms, as defined 
in (1) and illustrated in (2). 


(1) Phrasal vs. clausal idioms 
a. Phrasal Idioms are headed by a lexical head (e.g., 2a). 


b. Clausal Idioms are headed by a sentential functional head (a fixed tense or 
mood, a modal, obligatory (or impossible) sentential negation or 
CP-material); they are not necessarily full clauses (e.g., 2b) 


Fixed sentential material is specified in parentheses. Non-idiomatic material within id- 
ioms is marked by italics. 


(2) a. land on one's feet 
‘make a quick recovery’ 
b. can't see the forest for the trees (modal, negation) 
‘doesn’t perceive the whole situation clearly due to focusing on the details’ 


Given that 'idiom' is a pre-theoretic term referring to various types of fixed expres- 
sions, we defined a core set. The set consisted of conventionalized multilexemic expres- 
sions whose meaning is figurative (metaphoric) and unpredictable by semantic composi- 
tion. A property often mistakenly conflated with the unpredictability of idioms' meaning 
is the level of opacity or transparency of their meaning. Idioms indeed differ from one 
another in the level of their transparency (opacity). For example, the phrasal and clausal 
idioms in 3a and 3b respectively may be felt more opaque than those in (2a-b). However, 
the degree of opacity can be determined only once we know the meaning of the idioms; 
neither the former nor the latter meanings can be predicted based on the meaning of 
their building blocks. Hence, the meanings of the idioms in (2) just like those of the 
idioms in (3) are unpredictable (even if a posteriori, more transparent). Such idioms are 
therefore part of the core set we have defined and included in our study. 


(3 a cool one's heels 
‘wait’ 
b. can’t hold a candle to someone/something (modal, negation) 
‘be not as good as someone/something else’ 


We first concentrated only on phrasal idioms. This enabled us to examine a coherent set 
of idiomatic expressions. 

The English survey we ran produced similar results to those of the Hebrew one. The 
transitive, unaccusative, and adjectival passive exhibited unique idioms, just like their 
Hebrew counterparts? Examples of unique unaccusative (4), adjectival passive (5), and 
transitive (6) idioms are given below. Notice that the nonexistent idiomatic version is no 
less plausible than the existing idiom. (# means the relevant sequence of words has no 
idiomatic meaning.) 


3 The English survey was conducted following the guidelines in Horvath & Siloni (2009) (see note 2). The 
number of predicates out of the sample of 60 giving rise to unique phrasal idioms in English: 15 unac- 
cusatives; 18 transitives; 10 adjectival passives. 
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(4) a. burst at the seams (unaccusative) 
‘filled (almost) beyond capacity’ 
b. #burst something at the seams (transitive) 
(5 a. caught in the middle (adjectival passive) 
‘trapped between two opposing sides’ 
b. #catch someone in the middle (transitive) 
(6) a. turn something on its ear (transitive) 


‘change something in a surprising and exciting way’ 


b. Sturm on its ear (unaccusative) 


However, unlike in Hebrew, the verbal passive in English turned out, prima facie, to 
present unique verbal passive idioms for 2 out of the 60 predicates, namely for caught 
and bitten. 'These idioms are given in (7). 


(7 a. caught in the crossfire 
‘hurt by opposing groups in a disagreement’ 


b. bitten by the x bug (where x forms a compound with bug) 
‘having the need/desire/obsession for x’ 


These phrasal idioms can be suspected at first to constitute unique verbal passive id- 
ioms, due to their listing in idiom dictionaries in the passive form, and not in the active, 
in contrast to the norm of listing verb phrase idioms in dictionaries in the active form. 
Moreover, according to native speakers, these forms can be modified by adverbials of 
duration or appear in the progressive, suggesting that they have eventive, verbal occur- 
rences. 

However, on closer examination, both of these turned out not to constitute true coun- 
terexamples to the generalization that there are no idioms unique to the verbal passive. 
Starting with 7a, the idiom caught in the crossfire, which indeed appears in the verbal 
passive, in fact is attested - based on Google searches accompanied by native speakers’ 
judgments - also in the transitive (active) form, as in (8), for instance; hence it is not a 
unique verbal passive idiom. 


(8) a. This caught him in the crossfire between radical proponents of 
independence and French opponents of anti-colonialism. 
(Scheck, 2014:282)4 


b. ...the Israeli-Palestinian conflict, which has often caught them in the 


crossfire. https://goo.gl/f2FbbG 


The idiom in 7b is instantiated by versions such as bitten by the travel bug, bitten by 
the acting bug, etc. These, just like 7a, can be true verbal passive forms; however, again, 


4 Scheck, Raffael. 2014. French Colonial Soldiers in German Captivity during World War II. Cambridge: Cam- 
bridge University Press. Available at https://goo.gl/QAGf9E. All online examples accessed 9 December 
2016. 
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Google searches turn up a significant number of active transitive examples of the same 
idiom, e.g., (9-10). 


(9) Before the acting bug bit me I had dreamed of being another Glenn 


Cunningham. (Halbrook 2001, 66)° 
(10) It was during my time in the Army in the 1960s and 1970s that the travel bug bit 
me. (MacKrell 2006, Introduction) 


The listing of (7a-b) in the passive participial form may well be due to the fact that 
in addition to occurring as a verbal (eventive) passive, they are also attested in the ad- 
jectival (stative) passive; the latter point is demonstrated by the idioms' occurrence as 
complements of verbs selecting APs but not VPs, such as seem and remain (Wasow 1977), 
as illustrated by (11-12). 


(11) a. Everyone else seems caught in the crossfire between these two, I honestly 


feel bad about everyone involved. https://goo.gl/trJp5o 
b. The Starbucks coffee chain remains caught in the crossfire of a dispute over 
"open carry" laws... https://goo.gl/PMCiMF 
(12) a. ..and Kevin remains bitten by the travel - and mapping - bug. 
https://goo.gl/xC8kWp 
b. It made an impression on Bowley, and he too seems bitten by the renovation 
bug... https://goo.gl/HO4LWn 


More generally, in the case of English in particular, it is important to keep in mind that 
there is the interfering factor of the common identity of form between verbal passives 
and adjectival passives, and only diagnostics can establish whether or not the particular 
idiom is indeed a verbal passive, and not (only) an adjectival passive one (see Wasow 
1977 for diagnostics). 

We can thus conclude that the idioms in (7) are not exceptions to the generalization 
that there is no unique idiom in the verbal passive." The next question is what can explain 
this. 

A priori, two alternative types of explanations for the above generalization come to 
mind: a derivation-based account in the spirit of derivational approaches of Generative 
Grammar or alternatively, an inheritance-based account, along the guidelines proposed 
by CxG. Abstracting away from details, a derivation-based account would have the ver- 
bal passive formed beyond the domain of special meanings, which would prevent verbal 
passives from having their own special/idiomatic meaning. An inheritance-based ac- 
count would have the verbal passive inherit the inability to give rise to idioms that it 


5 Halbrook, Hal. 2011. Harold - The Boy Who Became Mark Twain. New York: Farrar, Straus and Giroux. 
Available at https://goo.gl/ivWkAQ. 

6 MacKrell, Thomas. 2006. One Orbit - Around the World in 63 Days. Victoria, Oxford: Trafford. Available 
at https://goo.gl/bRIHKA. 

7 Additional idioms (headed by predicates not included in our sample) that may be suspected to be unique 
verbal passive idioms are discussed by Horvath & Siloni (2016), and are shown to also conform to the 
generalization. 
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does not share with its transitive alternant from the inability of verbal passives to lack 
a transitive alternant. 

A first indication that an inheritance-based account is not on the right track comes 
from inspection of the transitive-unaccusative alternation. This alternation manifests 
regularity at the verb level, but pervasive uniqueness at the idiom level. Intransitive 
unaccusative verbs have a transitive alternant (with a Cause external role) and vice versa 
(13), except for isolated instances (Hartl 2003, Reinhart 2002, among others). 


(13 a. Dan / The storm / The stone broke the window. 
b. The window broke. 


In other words, there are sporadic, isolated gaps in the transitive-unaccusative verbal 
alternation but the paradigm is rather regular. Nonetheless, there is pervasive unique- 
ness, namely, unpredictable gaps are common, at the idiom level. If so, then, the dis- 
tribution of phrasal idioms across diatheses is not determined by or inherited from the 
degree of productivity of their respective predicates. 

In sum, an inheritance-based account does not seem to be able to account for the 
observation that the verbal passive, unlike the transitive, unaccusative and adjectival 
passive cannot head unique phrasal idioms. Additional evidence against an inheritance- 
based account comes from clausal idioms, which are discussed in the next section. 


3 Clausal idioms 


As defined in 1b, clausal idioms are not necessarily full clauses; they are headed by a 
sentential functional element: a fixed tense or mood, a modal, obligatory (or impossible) 
sentential negation or CP-material. Examples of clausal idioms are given below: 2b and 
3b repeated as (14a-b) and additional examples in (14c-e). 


(14) a. ..can't see the forest for the trees (modal, negation) 
‘doesn’t perceive the whole situation clearly due to focusing on the details’ 
b. can't hold a candle to someone/something (modal, negation) 
‘be not as good as someone/something else’ 


8 For example, the transitive alternant may be missing idiosyncratically and sporadically in a given language 
for a few instances, but these instances have a transitive alternant with a Cause role at some other stage in 
the evolution of the same language (e.g., the recently developing transitive faint in Hebrew (i)) or in other 
languages at present (e.g., existence of the transitive fall in Hebrew (ii), but not in English). 


(i) Barur $e-hu  xavat bo dey  xazak im hu ilef oto. 
evident that-he hit — in.him rather strong if he fainted. TRANSITIVE him 


‘Tt is evident that he hit him rather strongly if he made him faint? 
https://goo.gl/GK7MWR 


(i) Dan hipil Sney sfarim. 
Dan fell.TRANsITIVE two books 
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c. butter wouldn’t melt in someone’s mouth (modal, negation) 
‘someone is acting innocent’ 


d. The squeaky wheel gets the grease. (tense) 
‘The most noticeable (loudest) ones are the most likely to get attention: 


e. not have a leg to stand on (negation) 
‘have no support (for your position)’ 


L Where does someone get off doing something? (interrogative, wh-phrase) 
"Where does someone get the right to/how dare someone do something?’ 


One may wonder at this point whether some of what we consider clausal idioms here 
would not be classified more appropriately as proverbs rather than (clausal) idioms. In- 
deed, the common though informal distinction between proverbs vs. idioms is worth 
some clarification. 

Proverbs have no precise linguistic definition. Just like our clausal idioms, they too 
are headed by some functional, rather than lexical, head. The definition we have given 
to delineate the core set of idioms, given the goals of our study, is aimed at obtaining 
evidence about lexical storage; therefore, our idioms all have properties that force them 
to be stored, and specifically stored in the grammar (not in extralinguistic storage in 
general memory). Consequently, the questions we need to ask regarding any clausal 
idiom suspected to be a proverb are: (a) Is the meaning of the expression unpredictable 
based on composition of its parts and does it involve figuration? If so it must be stored; 
(b) Is there evidence that it is stored in the storage component of the grammar, and not 
extragrammatically? The clausal idioms used in our study satisfy both of these criteria 
(on satisfaction of criterion (b), see our discussion of examples 19-22 below), thus they 
are properly falling within the set of relevant idiom data to be considered. As for whether 
some of them may be felt to be proverb-like (due to some additional, stylistic, aspectual 
or other properties) this is not a factor that effects the validity of the conclusions drawn 
based on them, as long as they meet the criteria for intra-grammatical (lexical) storage, 
as explained above.’ 

Unlike phrasal idioms, clausal idioms do occur as unique to the verbal passive. Exam- 
ples are given in (15-16) for English and (17-18) for Hebrew. As mentioned in $1, it is 


? Observe that there is a difference between the various (fixed) clausal expressions in terms of the pres- 
ence/absence of figuration they manifest. Expressions such as (i) are fixed in form and are felt to be proverbs 
(as pointed out by an anonymous referee), but involve no figuration and hence are not classified as idioms 
according to our criteria; in contrast the expressions in (ii) do manifest figuration and constitute idioms 
under our definition. At the same time, both (i) and (ii) may be felt to be proverbs. This intuitive notion 
does not seem to be associated with figuration. A property that does appear to play a role in the perception 
of a fixed clausal expression as a proverb is that it applies to a generic, rather than episodic, situation. (This 
property is orthogonal to qualifying as an idiom.) 


G) 


a. Two wrongs don’t make a right. 
b. When the going gets tough, the tough get going. 


(ii) a. A stitch in time saves nine. 


= 


A chain is only as strong as its weakest link. 
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often difficult to decide whether a certain idiom variant exists or only could exist, but 
not documented, and constitutes an ad hoc “playful” intended distortion, alluding to an 
existing idiom. Our data therefore are based on idiom dictionaries and the diathesis/es 
that they list the idioms in. In addition, however, we have googled idioms to check their 
existence in root-mate variants. We did not consider isolated occurrences, including 
playful distortions, which mostly appear in specific styles, such as media language, as 
evidence of existence. 


(15) a. might/may as well be hung/hanged for a sheep as (for) a lamb (modal) 
‘may as well commit a larger transgression, as the same punishment will 
result’ 


b. #(They) might/may as well hang someone for a sheep as (for) a lamb." 


(16) a. Gardens are not made by sitting in the shade. (negation, tense) 
‘Nothing is achieved without effort’ 


b. #One doesn’t make gardens by sitting in the shade. 


(17 a. Nigzezu maxlafot-av. (tense) 
sheared.VPassivE hair-his 


‘lost one's power/influence: 


b. #gazezu et maxlafot-av. 
sheared.TRANSITIVE.IMPERSONAL ACCUSATIVE hair-his 


(18 a. Hutla ha-kubiya. (tense) 
cast. VPASSIVE the-die 


"Ihe process is past the point of return? 


b. #Hetilu et ha-kubiya. 
cast. TRANSITIVE.IMPERSONAL ACCUSATIVE the-die 


Thus, while there are no unique phrasal idioms in the verbal passive, there appear 
to exist clausal idioms unique to the verbal passive. This provides additional evidence 
against an inheritance-based account of the lack of phrasal idioms unique to the verbal 
passive, that is, against the proposal that it is the necessary existence of a transitive al- 
ternant for all verbal passives that is inherited by (or transmitted to) the corresponding 
phrasal idioms. Initial evidence that such an inheritance-type account is not on the right 
track was presented in §2 based on inspection of the transitive-unaccusative alterna- 
tion. This alternation, as we noted, manifests regularity at the verb level, but pervasive 
uniqueness at the idiom level. Its behavior thus is incompatible with the idea that there 
is inheritance of properties from the verb level to the idiom level. Our findings regarding 
the existence of clausal idioms unique to the verbal passive, exemplified in (15)-(18), are 
also incompatible with such an inheritance-based account. If it was indeed merely inher- 
itance by the verbal passive idiom of the non-uniqueness property of the verbal diathesis 


10 A reviewer called our attention to the existence of occurrences of the idiom in the unaccusative. One online 
dictionary (out of eight) listed the clausal idiom in the unaccusative form, not in the verbal passive: One 
may/might as well hang for a sheep as a lamb. 
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(i.e., necessary existence of a transitive alternant), then there does not seem to be any 
reason why phrasal idioms would inherit “non-uniqueness”, while clausal idioms in the 
verbal passive would not do so. So not only is there no inheritance of distribution from 
the verb level to the idiom level, as shown by the transitive-unaccusative alternation, but 
in addition, an inheritance-based account could not explain the distributional distinction 
between phrasal versus clausal idioms regarding the verbal passive. Note also that the 
discrepancy between phrasal and clausal idioms with regard to uniqueness in the verbal 
passive seems to hold across languages, yet it certainly cannot be attributed to general 
cognitive constraints or functional needs of the constructions. If all the theory has at its 
disposal is inheritance networks, cognitive constraints, and functional needs to explain 
generalizations exhibited by members in the construct-i-con, the above findings cannot 
be accounted for. 

One could perhaps suggest that unlike phrasal idioms, clausal idioms are stored extra- 
grammatically, outside the construct-i-con (similar to memorized language material such 
as lines of poems, etc.), and therefore they do not inherit the non-uniqueness property 
from the verbal passive. Such a line of explanation however does not seem to be tenable. 
Unlike memorized language material, clausal idioms interact with the grammar and it is 
thus hardly plausible that their storage is extra-grammatical. First, they can appear as 
embedded clauses within various matrix contexts (19a—b). Further, they need not be full 
clauses and can include a non-idiomatic argument (20a-b). Moreover, the non-idiomatic 
element can occur within a sub-constituent (21a-b). Finally, they can include variable 
pronouns obligatorily bound by a non-idiomatic noun phrase (22a-b), (the variable pro- 
noun indicated by one has to be bound by the subject in 22a and 22b). 


(19) a. One should take into account the fact that [the squeaky wheel gets the 
grease]. (tense) 
*One should take into account the fact that [the most noticeable (loudest) 
ones are the most likely to get attention]: 

b. They had to realize that [the leopard does not change his spots]. (negation) 

"Ihey had to realize that [one remains as one is even if one pretends 
otherwise/tries hard]: 

(20) a. can't see the forest for the trees (modal, negation) 
‘doesn’t perceive the whole situation clearly due to focusing on the details’ 


b. wouldn't touch someone/something with a ten-foot pole ` (modal, negation) 
"wouldn't have anything to do with someone/something' 
(21 a. wouldn't put it [past someone] (modal, negation) 
‘consider it possible that someone might do something wrong or unpleasant’ 
b. butter wouldn't melt in [someone's mouth] (modal,negation) 
‘someone is acting innocent’ 
(22) a. can’t fight one's way out of a paper bag (modal, negation) 
‘be an extremely inept’ 


b. would give one’s right arm (for...) (modal) 
‘would like something very much’ 
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Below we turn to an additional distinction between phrasal and clausal idioms in order 
to reinforce our conclusion thus far. 


4 Diathesis sharing vs. rigidity 


In both English and Hebrew, phrasal idioms can be common to, i.e., shared between, 
root-alternants. The verbal passive always shares its idiomatic meaning with the cor- 
responding transitive (e.g., 23), as discussed in §2. Moreover, the other diatheses (the 
transitive, unaccusative, and adjectival passive), which appear in unique idioms, can 
also share their idiomatic meaning with their root-alternants (24-25).!! 


(23) a. spill the beans (transitive) 
‘divulge the secret’ 
b. The beans were spilled. (verbal passive) 
(24) a. burst someone's bubble (transitive) 
‘destroy someone’s illusion’ 
b. someone’s bubble burst (unaccusative) 
(25) a. carve something in stone (transitive) 


‘fix some idea/agreement permanently’ 


b. carved in stone (adjectival passive) 


In contrast, the clausal idioms in our preliminary investigation, unlike the phrasal 
ones, fail to exhibit sharing across diatheses. Clausal idioms seem to be unique, as illus- 
trated by examples (26-29) below. 


Transitive vs. verbal passive 


(26) a. can’t see the forest for the trees (modal, negation) 
‘doesn’t perceive the whole situation clearly due to focusing on the details’ 


b. #The forest can't be seen for the Dees H 


H We have conducted two surveys of shared idioms. The results are as follows. The number of English 
transitive predicates (out of the sample of 60) sharing phrasal idioms with the verbal passive: 35, with 
unaccusative: 17, and with adjectival passive: 21. The number of Hebrew transitive predicates (out of the 
sample of 60) sharing phrasal idioms with the verbal passive: 10, with unaccusative 16, and with adjectival 
passive: 5. Note that while phrasal idioms in the verbal passive always have a transitive version; it is not 
the case that any transitive idiom has a corresponding verbal passive idiom, as discussed in §5. 

12 This idiom does have occurrences in the verbal passive (found by Google searches). However, the idiom 
shows signs of being in the process of developing a phrasal version. This process is indicated by the 
existence of a large number of occurrences of this idiom in a phrasal version headed by a variety of lexical 
verbs, each yielding the same meaning as the original clausal idiom: ignore the forest for the trees, miss the 
forest for the trees, neglect the forest for the trees. The evolving use of this idiom in a phrasal form may be 
the reason for the occurrences of a verbal passive version. See also fn. 17. 
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Transitive vs. unaccusative (in the adjunct clause) 


(27 a. You can’t make an omelet without breaking a few eggs. (modal, negation) 
‘It is difficult to achieve something important without causing any 
unpleasant effects: 


b. #You can't make an omelet without a few eggs breaking. 
Adjectival passive vs. transitive 


(28) a. The road to hell is paved with good intentions. (tense) 
‘People often mean well but do bad things: 


b. #Good intentions pave the road to hell. 
Unaccusative vs. transitive 


(29) a. do(es) not grow on trees (auxiliary, negation) 
‘is not abundant, not to be wasted’ 


b. #do(es) not grow something on trees 


One might think at this point that the emerging lack of cross-diathesis flexibility of 
clausal idioms could be due to the fact that in English the relevant diathesis alternations 
involve syntactic movements reordering subparts of the idiom. These movements might 
be suspected to be incompatible with the idiomatic reading for reasons of information 
structure, independent of the diathesis change itself. However, examining clausal idioms 
with regard to parallel diathesis alternations in Hebrew, a language in which diathesis 
alternations do not have to involve such potentially interfering factors, seems to point 
in the same direction. For instance, the Hebrew clausal idiom in 30a does not require 
reordering (nor addition of words), when undergoing the diathesis alternation in 30b; 
still the latter is impossible. 


(30) a. kSe-xotvim ecim, nitazim $vavim. 
when-chop.TRANSITIVE.IMPERSONAL trees, sprinkle.UNACCUSATIVE chips 
(tense) 


"When you act, there are risks’ “Where trees are felled chips will fly’ 


b. £k$e-xotvim ecim, metizim 
when-chop.TRANSITIVE.IMPERSONAL trees, sprinkle.TRANSITIVE.IMPERSONAL 
$vavim. 
chips 


If knowledge of language were nothing more than an inventory of constructions 
whose properties derive from cognitive constraints, functional needs and inheritance 
hierarchies, there would be no way to explain why the clausal idioms we have examined 
(full and partial sentential structures) are unique to their diathesis, while phrasal idioms 
are commonly shared across diatheses. 

In sum, an inheritance-based account cannot explain why idioms headed by members 
of the unaccusative alternation show pervasive uniqueness at the idiom level, although 


482 


22 “Constructions” and grammar: Evidence from idioms 


the verbal alternation is rather systematic. Moreover, under such an account, it is com- 
pletely unclear why clausal idioms can be unique to the verbal passive as well as to other 
diatheses, and even seem to be unique generally, while phrasal idioms cannot be unique 
to the verbal passive, but can be unique to other diatheses. 

Below we consider what an alternative approach, one that can provide a principled 
account for the above generalizations, should look like, and sketch our proposal in terms 
of idiom storage, which derives these findings. 


5 Alternative, derivational accounts 


CxG imposes no principled limitation on lexically stored syntactic objects and assumes 
no syntactic (online) derivation, only stored objects (“constructions”), whose interrela- 
tions are expressed via inheritance networks. The inability of CxG to capture the distri- 
butional asymmetries of diatheses in idioms established in the preceding sections is a 
direct consequence of these fundamental characteristics of the model. We believe that 
in contrast to the CxG model, modular derivation-based theories, namely theories in- 
corporating a fundamental distinction between lexically stored entities versus syntactic 
objects derived by the computational system of grammar have the potential to provide 
an adequate account for the above findings. Before sketching the particular account 
that we propose, observe what assumptions are available in derivation-based modular 
architectures — and absent in non-derivational, construction-based models - that seem 
prerequisites for accounts aiming to capture the diathesis asymmetries discussed above. 

What seems crucial for conceiving a syntactic account is the incremental building 
of structure in the syntactic derivation, yielding units in the course of the derivation 
(“phases”) that impose locality limitations on the accessibility of special/idiomatic mean- 
ings. As for lexical accounts (involving the storage component of grammar) what is 
crucial would be principled constraints on what can be stored in the lexicon and in what 
manner, as will be explained in what follows. 

In the remainder of this section, we sketch an account along the latter lines within 
our model of Type Sensitive Storage (TSS) (Horvath & Siloni 2016). The model derives 
the diathesis asymmetries discussed in the previous sections from a different storage 
technique motivated for phrasal versus clausal idioms. Under this proposal, the distinct 
storage technique of phrasal versus clausal idioms is a direct consequence of their having 
a lexical versus a functional head, respectively. Each storage strategy, in turn, results in 
a different pattern of distribution across diatheses. As summarized in (31), the Type- 
Sensitive Storage model suggests that phrasal idioms are stored as subentries of existing 
lexical entries, whereas clausal idioms constitute independent lexical entries on their 
own, that is, are not stored as subentries. 


(31) The Type-Sensitive Storage (TSS) Model 


a. Idioms are stored as part of our linguistic knowledge (not as general, 
non-linguistic information). 
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b. Phrasal idioms - Subentry Storage: Phrasal idioms are stored as subentries of 
the lexical entry of their head (and possibly of their other constituents).? 


c. Clausal Idioms - Independent Storage: Clausal idioms are stored as 
independent entries on their own. 


Let us see how this would account for the findings. Subentry storage is contingent 
upon the listing, i.e., the existence, of the (mother) entry in the lexicon. The verbal 
passive is formed beyond the storage component, the lexicon (Baker, Johnson & Roberts 
1989, Collins 2005, Horvath & Siloni 2008, Meltzer 2012, among others). It follows that 
the verbal passive is not stored; it is not a lexical entry. Hence, the verbal passive cannot 
have subentries. Thus, under 31b, phrasal idioms cannot be unique to the verbal passive 
because such idioms cannot be stored. Phrasal idioms in the verbal passive can only be 
formed by passivization of their transitive counterparts. Hence, they always share their 
idiomatic meanings with the corresponding transitive. The transitive, unaccusative and 
adjectival passive, in contrast, are formed in the lexicon (Horvath & Siloni 2008; 2011, 
Reinhart 2002), and stored there; therefore, they can have subentries. 

It should be observed that unlike the existence of a transitive (active) version for every 
verbal passive phrasal idiom, we, correctly, do not predict the automatic existence of a 
verbal passive version for every transitive idiom. Since verbal passives are derived in 
the syntax, the question determining whether or not a transitive idiom will exist in the 
verbal passive depends on whether the idiom is able to undergo the syntactic operation of 
passivization resulting in a well-formed output. This in turn involves interpretive factors, 
such as whether the idiom chunk to become the derived subject of the passivized idiom 
has the appropriate semantic properties, e.g., referentiality, to be compatible with the 
information structure consequences of being in subject position. Hence, the contrast 
between The beans were spilled vs. "The bucket was kicked (see for instance Nunberg, 
Wasow & Sag 1994, Ruwet 1991, Punske & Stone 2014 on what factors may determine 
whether or not a verbal passive version of a transitive idiom is possible). 

Clausal idioms in contrast are stored as independent entries. Let us first motivate 
this claim. The head of clausal idioms is a functional, not a lexical, element. Functional 
elements unlike lexical ones are closed class items, have no descriptive content (Abney 
1987), and bear no thematic relation to their complement. Functional elements have often 
been argued to be stored in a separate lexicon, e.g., Emonds’ 2000 “Syntacticon’, or as “f- 
morphemes" (Distributed Morphology). One storage option for clausal idioms would be 
storage as subentries of their functional head. This would, for instance, mean storage of 
the idiom not have a leg to stand on as a subentry of its functional head, Neg. This would 
be storage of entities that have descriptive content in the "functional lexicon", where 
entries do not have descriptive content. This seems to us incoherent. We therefore do 
not pursue this option. 

Two additional options come to mind: (i) Clausal idioms are stored as subentries of 
the lexical head of the "extended projection" (in Grimshaw's 1991 terms) constituting 


13 The question as to whether they are also stored as subentries of the lexical entries of their other constituents 
is important but irrelevant for our purposes here. We therefore abstract away from it here. 
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the clausal idiom, namely, storage under the verb on a par with VP idioms. (ii) Clausal 
idioms are independent entries on their own; they are not stored as subentries of another 
lexical entry 31c. Subentry storage under the lexical head (option (i)) predicts the absence 
of unique clausal idioms in the verbal passive (just like in the case of phrasal idioms); 
this is contradicted by our findings (e.g. 15-18). 

Independent storage 31c predicts occurrence of unique clausal idioms in the verbal 
passive, in concert with our findings. Under independent storage, clausal idioms get lex- 
icalized in one piece (following consistent use of the expression in the relevant contexts). 
Clausal idioms thus do not require that their subconstituents be represented as entries 
in the lexicon. They get stored as a whole and can therefore include any diathesis (or 
any other syntactic output). Hence, there should be clausal idioms unique to the verbal 
passive. There are thus reasons to adopt the independent storage strategy for clausal 
idioms.” 

If phrasal idioms were stored as independent constructions, on a par with clausal id- 
ioms, there would be no reason why they could not be unique to the verbal passive. Pre- 
cisely because phrasal idioms are not stored constructions (contra CxG’s assumptions), 
they cannot be unique to the verbal passive. 

The difference in storage that the two types of idioms employ can also explain the 
second distinction revealed between them. While phrasal idioms commonly share id- 
iomatic meanings across root-counterparts, clausal idioms tend not to be shared across 
diatheses (§4). As already discussed above, a verbal passive phrasal idiom must share its 
idiomatic meaning with the corresponding transitive because it is formed by syntactic 
passivization of the latter (it is not stored). Further, under the TSS model, sharing of 
phrasal idioms between the transitive and its (lexically derived) unaccusative or adjecti- 
val passive alternants is the result of the links between root-related entries in the lexicon, 
which can induce spread of special meanings and idiomatic expressions between the en- 
tries. Sharing is not automatic though, as it requires additional listing under the entry 
of the relevant alternant; hence there are also unique phrasal idioms in these diatheses, 
as discussed in §2. 

In contrast, under 31c, clausal idioms are stored as independent entries, not as suben- 
tries of other entries that may be linked to root-mates. The model therefore predicts 
that nothing would induce sharing of idiomatic meaning between the transitive and its 
unaccusative or adjectival passive alternants; such sharing thus should be unattested or 
rare, as our preliminary results show.” 

Verbal passives, unlike the transitive, unaccusative, and adjectival passive, are derived 
in the syntax. So there is no a priori reason not to expect the application of this operation 
to (some) transitive clausal idioms. If that occurred, at least some clausal idioms would 


14 Interesting questions arise with regard to the storage of idioms with no recognizable internal structure 
(e.g., trip the light fantastic), as well as idioms (arguably) headed by a non-sentential functional element 
(a light verb, a functional preposition, or a conjunction, as in take a shower, in a rut, and cut and dried, 
respectively). These important questions are beyond the scope of this paper and are not directly relevant 
for the issue of cross-diatheses distribution. 

15 A priori, nothing rules out the independent development and storage of a clausal idiom in a root-related 
diathesis. However, we predict this to be very rare (if at all attested) as nothing induces this. 
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be available in the verbal passive. If no sharing occurs in the case of clausal idioms, 
it could be due to the inaccessibility of clausal idioms to internal syntactic operations, 
resulting from their being lexical entries inserted into the syntax as single one-piece 
units.” 


6 Conclusion 


We have distinguished between two different types of idioms — phrasal idioms vs. clausal 
idioms - and investigated their cross-diathesis distribution. Phrasal idioms distribute dif- 
ferently in the verbal passive vs. other diatheses: they cannot be specific (unique) to the 
former but can be specific to the latter. Clausal idioms do not seem to discriminate be- 
tween diatheses in this way: They seem specific to a single diathesis. These systematic 
distinctions show that even the properties of idioms, the archetypal “construction” a la 
CxG, require more than cognitive principles, reference to functional needs, and inher- 
itance networks of stored entities ('constructions") to be accounted for. An adequate 
theory of idioms must have recourse to a distinction between stored items and unstored 
derivational outputs, and to grammatical distinctions such as those between diatheses, 
and those between functional versus lexical elements. We sketched an account of the 
above findings, distinguishing between diatheses according to where they are formed, 
and storing idioms according to the type of element heading them (lexical or functional). 
Thus, the domain of idioms (surprisingly, from the CxG point of view) turns out to re- 
inforce the conclusion that there must be more to knowledge of language than a hierar- 
chical inventory of items and extra-grammatical constraints. 
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16 Whether or not syntactic passivization would apply to a particular clausal idiom would depend on whether 
the idiom has the semantic properties compatible with the changes in information structure induced by 
passivization (as mentioned above concerning phrasal idioms). 

17 A Google search reveals that the verbal passive version of the idiom in (i) does have some occurrences, 
though substantially fewer than the transitive form. 


(i) could've knocked me over with a feather 
I was extremely surprised, astonished’ 
DU (el could've been knocked over with a feather. 
The question is whether or not these occurrences indeed are clausal idioms at all. This cannot be un- 
equivocally determined because along with the clausal idiom (i), this idiom turns out to have also a phrasal 


transitive version: 'knock (someone) over with a feather' (listed in this form, with no fixed tense, no modal 
(and no fixed subject or object) (see the online Free Dictionary https://goo.gl/cv7RIT). See also fn. 12. 
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A-morphous Morphology is a morpheme-less theory of word-internal structure (Anderson 
1992). Under this approach, derivational patterns are analyzed using Word Formation (re- 
dundancy) Rules. By specifying systematic relations among the words of a language, Word 
Formation Rules generally describe, rather than derive, the structure of complex words. 
Here, on the basis of data from American Sign Language, we present a complementary 
view of lexical iconicity. We suggest that in the discussion of iconicity and of morphologi- 
cal structure alike, a distinction can be made between those signs whose internal structure 
has been eroded away, and those signs whose motivated internal structure is analyzable as 
part of a systematic pattern. 


1 Introduction 


Stephen R. Anderson’s A-morphous Morphology characterizes morphology as a form of 
linguistic knowledge (Anderson 1992: 181). This characterization is a response to re- 
silient misconceptions about word structure and the lexicon: morphology is traditionally 
thought to primarily involve an inventory of minimally meaningful forms and the gen- 
eral mechanisms through which these meaningful forms are combined to make complex 
words. This procedural view of morphology is in turn often justified by reference to de 
Saussure’s definition of the linguistic sign. However, Anderson (1985; 1992; In press) has 
shown that, in contrast to the “exaggeratedly minimal” (Anderson 1992: 326) analysis of 
complex words as composed incrementally from independently meaningful pieces, the 
Saussurean sign is a holistic, conventional relation “between a possibly complex form 
and its possibly complex meaning” (Anderson 1992: 193). 

Treating morphology as the knowledge that speakers have about holistic relationships 
between complex word forms and their complex meanings leads to a quite different con- 
ceptualization of the lexicon. Rather than merely a list of minimally meaningful forms, a 
speaker’s lexical knowledge must also comprise systematic relations between and among 
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the whole words of their language. Anderson proposes that these systematic relation- 
ships can be formalized using what are referred to as Word Formation (redundancy) Rules 
(after Jackendoff 1975). As a formal representation of patterns of similarity and differ- 
ence among related words, Word Formation Rules are “only superficially” a process by 
which new words are actively created or procedurally derived; their primary job is to 
codify systematic correspondences between words as an aspect of any speaker’s linguis- 
tic knowledge (Anderson 1992: 186). This perspective is motivated by the treatment of 
syntax as the knowledge that speakers have about how words are organized into sen- 
tences and of phonology as the knowledge that speakers have about how sounds are 
organized into words in linguistic theory. 

In this chapter, we demonstrate that Anderson’s "a-morphous" view of morphological 
structure provides a template for the study of iconic motivation in sign language struc- 
ture, as well. Characterizing morphology as the knowledge that speakers have about the 
relationships between word forms and their meanings leads to a quite different concep- 
tualization of iconicity, the perception of a motivated link between word forms and their 
meanings. 

Iconicity has traditionally posed a challenge to the field of sign language linguistics. 
Because the linguistic sign relation is commonly characterized as an arbitrary pairing of 
a word form and its meaning, the obvious links between sign forms and their meanings 
originally presented an obstacle to the recognition of sign languages as natural human 
languages. A way around this obstacle was to argue that lexical signs are essentially arbi- 
trary, despite their apparent iconicity, and moreover that signs can be shown to consist 
of smaller meaningless formative units. This view casts iconicity aside as etymological 
residue that is irrelevant for the understanding of recurring structural patterns in sign 
languages. 

Our claim is that, like morphology, iconicity is an aspect of linguistic knowledge. Our 
perspective follows Anderson's (1992) key observation about the nature of synchroni- 
cally analyzable morphological structure: While responsible for the formation and analy- 
sis of new words, derivational morphology is not typically actively engaged in the deriva- 
tion of established words from smaller meaningful components. Instead, the perception 
of transparent word-internal morphological structure is a reflection of the knowledge 
that speakers have about the relationships between whole words and their analogous 
constituent parts. Here, we demonstrate that this approach can also account for several 
morphological patterns in American Sign Language (ASL) in which a motivated, iconic 
link between meaning and form serves as the organizing principle. 

Our analysis of sign-internal structure in ASL builds from Anderson's treatment of 
Word Formation Rules as formal representations of patterns of similarity and difference 
among related whole words. When we consider whole words to be derivationally related 
to one another, partial relations among related words can be captured with a general 
rule, without expecting that whole words should exhibit incremental, semantically com- 
positional morphological structure (see also Ackerman, Malouf & Blevins 2016, Aronoff 
1976, Bochner 1993, Hay & Baayen 2005, Aronoff 2007, Blevins 2016, Anderson In press). 
Under this view, whole words are the primary unit of morphological organization. Re- 
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lated whole words with transparent, analyzable internal structure participate in morpho- 
logical patterns that can be described by a structural rule. However, individual words 
can become quite reduced and opaque over time, such that they eventually lose their 
synchronic morphological connection to other words in the language. Taken seriously, 
Anderson’s view leads to the conclusion that analyzable word-internal structure is most 
often a gradient reflection of etymological history, and only infrequently the derived 
output of a synchronic operation. 


2 The erosion of transparency in lexical signs 


The field of sign language linguistics has been compelled to demonstrate that, despite 
their apparent semantic and gestural transparency, signs are arbitrary linguistic symbols 
that are analyzable into smaller formal units (see Stokoe 1960, Klima & Bellugi 1979, and 
Supalla 1986 for examples). Underlying this work is the assumption that conventional 
linguistic symbols are, by definition, inherently arbitrary. Accordingly, if they are truly 
linguistic in nature, lexical signs should also be arbitrary symbols, even if they were once 
iconically motivated (however, see Wilcox & Wilcox 1995, Taub 2001, Perniss, Thomp- 
son & Vigliocco 2010, and Emmorey 2014 for critical reviews of this assumption). This 
perspective leads to the conclusion that iconicity is a secondary, etymological feature 
of individual signs, and ultimately erodes over time. For example, Frishberg (1975: 718) 
compares old (ca. 1918) and modern (ca. 1965) versions of several ASL signs, and argues 
that over time, “in general, signs have become less transparent, pantomimic, and iconic; 
they have become more arbitrary, conventionalized, and symbolic? 

Accordingly, when comparing old and modern versions of signs like cow and HORSE, 
which are both articulated at the signer’s head, and are motivated by an image of the 
animal’s horns and ears, respectively, we can appreciate that the older forms are more 
faithful to their original motivating image, while the newer forms have lost some of their 
original iconicity: The older form of cow is signed with two hands, one for each of the 
paired horns to be represented Figure 1a, while the newer, more typical form is signed 
with only one hand Figure 1b. This change over time has an articulatory motivation. It 
requires less effort to move one hand than it does to move both hands, and, because the 
second hand is configured identically to the dominant hand in these cases, the absence 
of the second hand does not hinder recognition of the target sign. As a result, the in- 
volvement of the second, non-dominant hand has been deleted from these signs in the 
course of history (Battison 1974, Frishberg 1975). 

As another example, Napoli, Sanders & Wright (2014: 437-438) demonstrate that syn- 
chronically, in casual signing, the form of the iconic sign HOUR is often altered to make 
the sign less difficult to articulate, which can result in the formation of a less iconic sign. 
In the citation form, the iconic sign HOUR is articulated with a dominant index finger 
tracing a full circle around the palm of the non-dominant hand. The iconic motivation 
for this sign is the movement of a minute hand around the face of a clock, with the non- 
dominant hand representing the face of a clock, and the dominant hand representing 
the angle and movement of the minute hand. In the citation form, the wrist serves as a 
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a. (beginning of sign) (end of sign) 
ü 


b. (beginning of sign) (end of sign) 


Figure 1: The ASL sign cow signed (a) with two hands and (b) with one hand. 


hinge for the circular movement of the dominant hand (Figure 2a). However, in the more 
casual form of HOUR, the locus of the dominant band e movement is transferred away 
from the wrist to the elbow and shoulder (Figure 2b). This change partially disrupts the 
iconic image of a clock hand tracing a journey around the clock face, as it is the whole 
hand, rather than the extended finger alone, which traces a circular movement. This 
change obscures the iconic representation of a clock's minute hand in the sign HOUR, 
and again, this change is favored for an articulatory reason, as it avoids “a physiologi- 
cally awkward movement" (Napoli, Sanders & Wright 2014: 438). The logical conclusion, 
based on examples like cow and Hour, is that in time, true, systematic processes work 
to erode the coincidental, iconic origins of any sign. 

We contend that these discussions about erosion of iconicity require more nuance. 
The cases cited above are indeed instances of signs reducing in ways that partially ob- 
scure their original motivating visual image. In these examples, the iconic motivation for 
a single sign is overcome by articulatory considerations. We assume that the primary 
constraint on this phonetic reduction is that the overall form of the sign itself should 
nevertheless remain recognizable as "the same sign": Processes of phonetic reduction 
can erode the forms of signs only once they have been registered as conventional lexical 
items with conventional forms and agreed-upon meanings, to begin with. However, we 
note that even in the face of phonetic reduction, the reduced versions of the signs cow or 
HOUR actually remain quite faithful to their iconic motivations. Both signs still transpar- 
ently represent the horn of an animal and the face and hand of a clock, respectively. In 
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ER 


(beginning of sign) (middle of sign) (end of sign) 


ITE 


(beginning of sign) (middle of sign) (end of sign) 


Figure 2: The ASL sign Hour signed (a) with the locus of rotation at the wrist 
and (b) with the locus of rotation at the elbow. 


these cases, at least, the shift is not from “wholly iconic sign" to “wholly arbitrary sign; 
but rather from “more transparent conventional sign" to “less transparent conventional 
sign.” 

Here it is important to note that this sort of gradient phonetic erosion also affects mor- 
phological transparency in conventional signs. Like spoken words! and like iconic signs, 
morphologically complex signs, once registered as conventional pairings of meaning 
and form, may begin to drift in ways that obscure their original etymology. An example 
discussed by Frishberg (1975: 707) is the ASL sign HOME. The conventional sign HOME de- 
rives etymologically from the composition of the signs EAT and sLEEP (this combination 
can be glossed as EAT+SLEEP). These signs were almost certainly selected to represent the 
concept ‘home’ because a “home” is “where one eats and sleeps.” However, as a function 
of its lexical entrenchment as a conventional sign, EAT+SLEEP has drifted both in form 
and in meaning. It has been reanalyzed as a semantically holistic sign meaning ‘home’, 
and has reduced in form so as to mask its former transparent relationship to its original 
constituent signs. As a result of this drift over time, the sign HOME no longer bears an 


! A anonymous reviewer rightly comments that this erosion is likely also modulated by frequency. In English, 
the classic example cupboard has undergone assimilation and reduction that obscures its connection to its 
original constituent words, while other (newer/less frequent) words like clipboard retain their original 
compound pronunciation (see Zipf 1935, Bybee 2001). The same reviewer eloquently notes that written 
alphabetical systems have also developed through this type of “creeping opacity,’ in which the written 
symbols became streamlined and less connected to their “original causal denotata.” 
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overt morphological relationship to its former constituent signs EAT and SLEEP in modern 
ASL. 

A related, synchronic example is the sign STUDENT, which is etymologically derived 
from the composition of the signs LEARN (whose form is iconically motivated by the im- 
age of moving an object into the mind) and person (whose form is iconically motivated 
by the silhouette of a human figure). While the citation form for STUDENT still retains 
much of its analyzable internal structure as a composite of LEARN+PERSON (Figure 3a), 
in casual signing, STUDENT is typically reduced to the point that its analyzable morpho- 
logical structure is no longer identifiable (Figure 3b). 


at a 


a. (beginning of sign) (middle of sign) (end of sign) 


ER 


b. (beginning of sign) (middle of sign) (end of sign) 


Figure 3: The ASL sign sTUDENT (a) in a fuller, more transparent form 
(“LEARN+PERSON’) and (b) in a reduced, more opaque form (“STUDENT”). 


Similar reduction can also be observed in the casual forms of the related signs INTER- 
PRETER and TEACHER. Like the sign sTUDENT, these signs are morphologically complex, 
and they can be analyzed as previously derived from INTERPRET+PERSON and TEACH+ 
PERSON. These signs participate in a productive derivational pattern in ASL involving 
the addition of PERSON as an “agentive suffix.” However, as frequently occurring signs, 
INTERPRETER, STUDENT, and TEACHER have all drifted in ways that render their morpho- 
logical structure increasingly opaque in casual signing. We discuss some implications of 
this erosion (and possible reanalysis) of transparent morphological structure in §3. 

As in (spoken and signed) morphology (see Bybee 2006), the gradual loss of iconicity 
is not an across-the-board phenomenon. Iconicity can also persist within signs when it 
becomes morphologized, or made systematic as a learned, language-internal pattern (see 
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Anderson 1992: 337). The loss of iconicity is therefore not as inevitable as is commonly 
believed. An example of a sign which might be considered to have lost its iconicity (an 
analysis that has been debunked by Wilcox & Wilcox 1995: 153, Taub 2001: 228, and 
Wilcox 2004: 123) is vERY-sLOW. The sign sLow is articulated with the dominant hand 
sliding over the back of the non-dominant hand in a single movement (Figure 4a). The 
slow movement of the hand can be considered iconically motivated, as the friction result- 
ing from the contact between the two hands causes the sign to be articulated somewhat 
slowly. In the derived sign veRy-sLow, however, the movement pattern has changed: 
VERY-SLOW is articulated with a short initial hold, followed by a quick, larger burst of 
movement (Figure 4b). We will demonstrate that this change in movement is character- 
istic of an "intensive" derivational pattern in ASL, but these facts originally led Klima & 
Bellugi (1979: 30), for example, to conclude that the iconicity of stow has been “overrid- 
den and submerged" in the formation of the sign vERY-sLOW, as it is signed with a very 


fast movement. 


(beginning of sign) (end of sign) 


ae 


(beginning of sign) (end of sign) 


Figure 4: The forms of the ASL signs (a) sLow and (b) vERY-srow differ primar- 
ily in their speed and size: vERY-srow is signed with a faster, larger movement. 


While the movement of the sign vERY-sLow is indeed quite fast, the process that de- 
rives the intensive version of sLow by changing its original form to incorporate a quick 
burst of movement is at once iconically motivated and systemically motivated: it is also 
at work in the formation of a number of other ASL signs. These signs include predicate 
adjectives like vERY-CLEVER, VERY-EXPENSIVE, and VERY-STUBBORN, and they all have 
in common that they derive the intensive form of a sign by increasing the intensity of 
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its conventional movement. Accordingly, it is not the case that vERY-sLOw has lost its 
iconicity. Instead, vERY-sLow has taken on a different type of iconic motivation, one that 
happens to be at odds with the idea that the only way to represent “incredible slowness” 
is to use a very slow movement (Taub 2001: 229). In this intensive derivational pattern 
in ASL, the intensity of a sign’s movement is increased, thereby iconically signaling an 
increase in the intensity of the sign’s meaning (Wilcox & Wilcox 1995: 153). 

This systematic iconic correspondence between whole words is precisely the kind of 
relationship that can be described using a Word Formation Rule. The fact that aspects 
of the pattern happen to be iconically motivated in no way precludes this rule from 
having been taken up and made systematic in ASL. Indeed, this discussion of the loss 
of iconicity in individual signs, and the preservation of iconicity when it is relevant for 
language-internal structure, is entirely compatible with Anderson’s a-morphous view of 
morphology, in which conventional words are Saussurean signs, regardless of whether 
they contain transparent, analyzable structure. Though they are often referred to as 
“lexical entries; Saussurean signs cannot be considered entries on a structureless list. 
Instead, a language user’s morphological knowledge also encompasses their knowledge 
of the relationships between the established words of their language.” 

Word Formation Rules are descriptions of the phonological, syntactic, and semantic 
differences and correspondences between two or more morphologically related forms. 
For example, the rule that describes the relationship between pairs of English words like 
breath and breathe, loss and lose, and grief and grieve (minimally) specifies a change in 
word-final voicing, a change in syntactic category, and concomitant changes in meaning. 
However, as in the analysis of non-concatenative morphology in spoken languages, the 
formal representation of phonological changes in sign language morphology has the 
potential to obfuscate more than to clarify. This problem is also compounded by the fact 
that there is no dominant conventional system for describing sign forms on analogy to 
the International Phonetic Alphabet for spoken language research. 

In order to discuss Word Formation Rules in ASL, we require a representational sys- 
tem that will allow us to recognize that signs are holistic pairings of complex form and 
complex meaning. The convention of labeling ASL signs with English metalinguistic 
glosses is, by itself, inadequate for this task. Labeling signs with English glosses illus- 
trates that they have conventional, holistic meanings, and implies that they similarly 
have conventional, agreed-upon forms. In order to facilitate an analysis of the iconic 
structure within ASL signs, we will adopt Taub’s (2001) convention of listing the aspects 
of form in a sign with their corresponding aspects of meaning. As Meir and colleagues 
(Meir 2010: 874; Meir et al. 2013: 316) have demonstrated, such iconic mapping diagrams 


? An anonymous reviewer comments that Anderson (1992) presents a realizational theory of morphology, 
in which an inflected word’s semantic content precedes and determines its phonological form. This is in 
opposition to concatenative theories in which a word’s form determines its content. Our a-morphous anal- 
ysis of iconicity in ASL word formation is meant to be consistent with a realizational theory of inflection. 
We do not discuss ASL inflectional morphology here because there are several competing perspectives as 
to what should even count as morphosyntactic inflection (namely agreement) in ASL. Reviewing these 
perspectives takes us beyond the scope of this chapter, but the reader is referred to Lillo-Martin & Meier 
(2011), Wilbur (2013), and Wilcox & Occhino (2016) for a sense of these different perspectives. 
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make it clear that iconicity is neither a deterministic nor a compositional property of 
signs: a sign may be a conventional pairing of form and meaning and also exhibit trans- 
parent and motivated aspects of structure. Crucially, the perception of iconicity arises 
as a consequence of the fact that signs are conventional pairings of a potentially com- 
plex form and potentially complex meaning, and not from a compositional analysis of 
the sign’s parts.? The meaning of the whole facilitates the (re)analysis of its parts, rather 
than the other way around. 

As an illustration of an iconic mapping in ASL, consider the sign sLow, already de- 
scribed impressionistically above (and pictured in Figure 4a). This sign has a conven- 
tional form and meaning, and aspects of its form can be analyzed as transparently mo- 
tivated by its meaning. These correspondences can be represented through an explicit 
pairing of aspects of the sign’s form with aspects of its meaning, as in Table 1. 


Table 1: Aspects of the iconic mapping for sLow. 


Form Meaning 

non-dominant hand a stationary object 

back of the non-dominant hand — a surface that creates friction 
dominant hand an object in motion 

palm of the dominant hand a surface that creates friction 
contacting movement contact between two surfaces 
dragging movement a movement slowed by friction 


This representation illustrates that the conventional sign sLow exhibits analyzable 
internal structure: formational aspects of this sign can be linked to aspects of the visual 
and kinesthetic images that provide the sign's iconic motivation. For example, in this 
case, it is possible to assign an iconic aspect of meaning to each ofthe two hands, as well 
as the manner in which the dominant hand contacts the non-dominant hand. 

The benefit of the representation in Table 1 is that it allows us to discuss the rela- 
tionship between the form and meaning of the whole sign as well as the relationship 
between the whole and its parts, including how these aspects of structure may change 
from sign to sign. For example, as discussed above, the Word Formation Rule for the "in- 
tensive” pattern alters the conventional mapping for stow by changing the character of 
the base sign's movement: the sign vERY-SsLOW is formed with a movement pattern that 
is superimposed onto the form of the original sign sLow, keeping the overall trajectory 
of the movement but adding a brief initial hold followed by a quicker and larger burst 


3 A reviewer notes, and it has been pointed out previously (e.g. Fernald & Napoli 2000), that there are also 
similarities between ASL morphology and iconic, sound-symbolic elements in spoken language such as 
phon(a)esthemes and ideophones. Phonaesthemes are recurring pairings of meaning and form occurring 
in words that cannot otherwise be analyzed as exhibiting compositional morphological structure (Ander- 
son 1992: 49, Bergen 2004). Ideophones are depictive, sound-symbolic words that appear in a variety 
of languages (Dingemanse 2012). We expect that an “a-morphous” analysis of iconicity and morphology 
should also extend to these classes of words, but leave the details of this project for future work. 
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of motion. The resulting sign vERY-sLow (pictured in Figure 4) can be represented as in 
Table 2. In this representation, the aspects of form and meaning that have been changed 
by the intensive Word Formation Rule are emphasized in bold. 


Table 2: Aspects of the iconic mapping for vERY-sLOW. 


Form Meaning 

non-dominant hand a stationary object 

back of the non-dominant hand — a surface that creates friction 
dominant hand an object in motion 

flat palm of the dominant hand a surface that creates friction 
contacting movement contact between two surfaces 
brief initial hold buildup of pressure 

quick, large movement release of built-up pressure 


Following Anderson e (1992: 186) formulation of the -able Word Formation Rule, for 
example, we can think of this relationship between pairs of signs sLow and vERY-sLOW 
in the following way: The form of the intensive Word Formation Rule specifies that the 
intensive form of a sign is made by changing its movement pattern. However, rather 
than a true “derivational” rule, the Word Formation Rule is regarded as a description of 
the systematic differences between the signs represented in Table 1 and Table 2. 


3 The (re)analysis of lexical iconicity 


Iconic mappings provide a way to represent the relationship between the form and mean- 
ing of a conventional complex sign. They also provide a way to specify how aspects of a 
sign's analyzable internal structure can be reanalyzed by speakers based on the meaning 
of the complex sign that they appear in. In this section, we explore this tradeoff between 
form and meaning. We begin with the ASL sign TIME, as an illustrative example of how a 
sign's relationship to its original iconic motivation can become obscured, and even how 
the form of the conventional sign can subsequently be reanalyzed by signers. We sug- 
gest that such reanalysis can only happen in a system where the sign relation between 
form and meaning takes precedence over the compositional structure that originally 
contributed to the sign's creation.* 

The ASL sign TIME is formed with the crooked index finger of the dominant hand tap- 
ping the back of the non-dominant wrist (Figure 5). ASL signers and non-signers alike 


4 Of course, signers can, and often do, create new signs, as well. We analyze these new signs as repurposing 
the patterns and elements that recur among established signs. A popular (in both senses) article from 2015, 
for example, discusses some ASL candidates for internet slang like selfie and photobomb. These potential 
signs make creative new use of old sign parts, though we hesitate to analyze them as semantically “com- 
positional” in the traditional sense. The article is accessible online (http://www.hopesandfears.com/hopes/ 
now/internet/168477-internet-american-sign-language), as is some additional commentary from an ASL 
news “vlog” (https://www.youtube.com/watch?v-wI808zgEK88). 
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readily recognize the similarities between this sign and the act of tapping the face of a 
wristwatch, for example as part of a gesture of impatience. The sign TIME can therefore 
be considered to have a transparent iconic motivation, stemming from the cultural as- 
sociation of reading a wristwatch with the telling of time. For signers, this analysis of 
TIME's iconic motivation is also reinforced by the fact that the ASL sign WRISTWATCH is 
indeed articulated in the same location, at the back of the non-dominant wrist. 


Figure 5: The ASL sign TIME. 


However, this etymological description of the ASL sign TIME is in fact a folk reanal- 
ysis. As Shaw & Delaporte (2010: 177) explain, "the origin of TIME was identified long 
before the advent of the wristwatch in 1904" They demonstrate that as early as 1785, 
the French Sign Language sign TIME was recorded in a form similar to that of ASL, its 
daughter language, with the crooked index finger repeatedly contacting the back of the 
non-dominant hand. The image motivating the form of this historical sign is the design 
and function of an early mechanical clock that uses a hammer to strike a bell at the stroke 
of an hour. Historical texts documenting Old French Sign Language describe this sign's 
form as showing “the hammer which taps the bell” and using the index finger to “ring 
the hour on the back of the hand which is in the guise of a bell" (Ferrand 1896 and Lam- 
bert 1865, respectively, as cited by Shaw & Delaporte 2010: 177-178). Following Taub's 
(2001) conventions for analyzing iconic mappings, we can represent aspects of the map- 
ping between the phonological and semantic elements of this historical sign TIME as in 
Table 3: 


Table 3: Aspects of the historical iconic mapping for TIME. 


Form Meaning 

non-dominant hand the bell of a clock 

back of the non-dominant hand surface of the bell 

dominant hand a figure which rings the clock 
crooked index finger the hammer which strikes the bell 
contacting movement the hammer striking the bell 
repeated movement a repeated action 
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By the time the wristwatch became popular in the early 1900s, the sign TIME had been 
in use for well over a century. Having already been established as a conventional pairing 
of form and meaning, it was, presumably, no longer primarily analyzed as deriving its 
meaning from its constituent parts. Parallel to the examples of cow and HouR mentioned 
above, the formational aspects of the sign TIME began to drift slightly, such that the index 
finger moved “a few centimeters from the back of the hand to the back of the wrist” (Shaw 
& Delaporte 2010: 178). The sign TIME was also no longer concretely linked to the image 
of a particular time-telling device. Accordingly, to the extent that they were associated 
with any meaning at all, the parts of the sign TIME must have derived their meanings by 
association with the meaningful whole sign. The sign’s existing internal structure was 
thus open to reanalysis as motivated by the image of a wristwatch, as is represented in 
Table 4. Here we see that the aspects of form are the same across both Table 3 and Table 4, 
however the mapped meanings differ between the historical and modern versions of the 
sign TIME. 


Table 4: Aspects of the modern iconic mapping for TIME. 


Form Meaning 

non-dominant hand a human hand 

back of the wrist the location of a wristwatch 
dominant hand a human hand 


crooked index finger a human finger 
contacting movement a human finger contacting a wristwatch 
repeated movement a repeated action 


We re-emphasize that this iconic reanalysis could only happen because the holistic 
relation between TIME’s form and meaning takes precedence over the aspects of structure 
that originally contributed to its creation. The sign TIME provides a very nice example, 
but it is not an exceptional case: all conventional signs in ASL are by definition registered 
as learned pairings of form and meaning, and many sign forms also remain open to 
iconic interpretation and reanalysis. Of course, conventional signs can serve as the input 
for productive derivational morphological processes as well. As a result, the motivating 
factors of language internal systematicity (morphology) and of analyzable visual imagery 
(iconicity) are inextricably interlinked as aspects of lexical motivation in ASL. 

Another relevant example, a somewhat uncommon sign which we refer to here as 
HASH-THINGS-OUT, is ultimately a reduced derivative of the ASL verb DEBATE. As we 
will show, the sign DEBATE is both iconically and derivationally related to a number 
of other ASL signs that conventionally connote ‘argumentation’, including ARGUE, OP- 
POSE, STRUGGLE, DISCUSS, and DISCUSS-IN-DEPTH. These signs are all morphologically 
related in ASL, though their corresponding English translations are not. Rather than 
getting bogged down in a discussion of the nuances of meaning between the English 
meta-language glosses, we will focus primarily on the relationship between form and 
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meaning among these morphologically-related iconic signs. We begin with the sign AR- 
GUE, which is formed with the index fingers of both hands pointing toward one another 
and simultaneously moving up and down several times (Figure 6). 


(beginning of sign) (end of sign) 


Figure 6: The ASL sign ARGUE. 


The iconic motivation for the sign ARGUE is the visual image of two people engaged 
in heated conversation, with each hand representing a participant in the argument, and 
with the orientation of the two hands towards one another representing that each par- 
ticipant's communicative efforts are directed toward the other (see Lepic et al. 2016 re- 
garding use of the two hands to represent paired referents in lexical signs). This sign's 
form also seems be motivated by the rhythmic properties of the beat gestures that often 
accompany continuous speech, and by the form of the finger-shaking gesture that often 
accompanies “scolding” or “telling somebody off? The association between form and 
meaning in the conventional sign ARGUE can thus be represented as in Table 5. 


Table 5: Aspects of the iconic mapping for ARGUE. 


Form Meaning 

dominant hand one side of an argument 

non-dominant hand the other side of an argument 

orientation of hands toward each other two sides communicating with each other 
index finger handshape the direction of attention 

coordinated movement of the hands communicative interaction between sides 
repeated movement an on-going process 


The signs oppose (Figure 7) and sTRUGGLE (Figure 8) are formed similarly to the sign 
ARGUE, with two index fingers pointed toward one another, however the movement 
patterns for these signs are different. While ARGUE is articulated with repeated up-and- 
down movements, OPPOSE is signed with the hands pulling away from one another in 
a single motion, and srRUGGLE is signed with both hands repeatedly moving back-and- 
forth together along the imagined line they form. 
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$4 


(beginning of sign) (end of sign) 


Figure 7: The ASL sign OPPOSE. 


$4 


(beginning of sign) (end of sign) 


Figure 8: The ASL sign STRUGGLE. 


In the sign oppose, the movement of the hands away from one another can be analyzed 
as motivated by an image of two participants in an argument giving up and retreating 
from one another. In the sign sTRUGGLE, the movement of the hands together can be 
analyzed as motivated by an image of two opposing forces retreating and advancing 
together in turn. These associations between form and meaning can be represented as 
in Tables 6 and 7, respectively. Note that the first several aspects of the iconic mapping, 
such as the use and relative orientation of the two hands, are shared between the signs 
ARGUE, OPPOSE, and STRUGGLE: the aspects that differ between these signs are again 
marked in bold. Here, again, the benefit of the iconic mapping notation is that it makes 
recurring configurations of form and meaning explicit among related and conventional 
iconic signs. 

Turning now to the related signs DISCUSS, DISCUSS-IN-DEPTH, and DEBATE, we see that 
these signs similarly use the index finger of the dominant hand to represent one side of 
an argument, however, in each of these signs, the “opposing side” is represented quite 
differently. In the sign Discuss, the “other side" is actually not represented at all: This 
sign is conventionally formed with the index finger of the dominant hand repeatedly 
striking the flat palm of the non-dominant hand (Figure 9). The form of the sign DISCUSS 
is also partially motivated by the visual image of a list of written topics under discus- 
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Table 6: Aspects of the iconic mapping for OPPOSE. 


Form Meaning 

dominant hand one side of an argument 

non-dominant hand the other side of an argument 

orientation of hands toward each other two sides communicating with each other 
index finger handshape the direction of attention 

movement of hands away from each other  retreating to opposite sides of an argument 
single movement a single event 


Table 7: Aspects of the iconic mapping for STRUGGLE. 


Form Meaning 

dominant hand one side of an argument 

non-dominant hand the other side of an argument 

orientation of hands toward each other two sides communicating with each other 
index finger handshape the direction of attention 

movement along the same plane advancing and falling back in an argument 
repeated movement an on-going process 


sion; in this sign, the non-dominant hand represents the message itself, serving as the 
primary target of communicative effort and as the place of articulation for the dominant 
hand. Note that the flat palm of the non-dominant hand similarly represents a surface 
for written material in signs like joT-DOWN (Figure 10), LEARN (first two segments of Fig- 
ure 3a above), and wnrrE. We do not provide an in-depth analysis of these “written-upon 
surface" signs here, but see Frishberg & Gough (1973: 118) and Aronoff et al. (2003: 75) 
for additional discussion. The association of form and meaning in the sign Discuss can 
be represented as in Table 8, which again exhibits several aspects of structure that have 
been seen already in the iconic mappings for ARGUE, OPPOSE, and STRUGGLE. 


(beginning of sign) (end of sign) 


Figure 9: The ASL sign Discuss. 
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(beginning of sign) (end of sign) 


Figure 10: The ASL sign JOT-DOWN. 


Table 8: Aspects of the iconic mapping for Discuss. 


Form Meaning 

dominant hand one side of an argument 
index finger handshape the direction of attention 
non-dominant hand topics under discussion 
flat palm handshape a written surface 
repeated movement an on-going process 


The sign DISCUSS-IN-DEPTH is in turn formed similarly to the sign piscuss, with the 
index finger contacting the flat palm of the non-dominant hand. However, rather than 
remaining in a single, fixed location, the hands move together between two locations, 
signed at first in front of the signer’s body, and then away from the body to represent a 
second interlocutor (Figure 11). The mapping for this sign is represented in Table 9. Like 
the signs OPPOSE and STRUGGLE, this movement between two locations represents the 
contributions of two participants to the discussion. However, unlike the sign OPPOSE, 
here there is not an implicit contrast between "sides of an argument" Instead, the addi- 
tion of another's perspective to the discussion is collaborative, and the discussion takes 
on greater depth as a result. 

When we move to consider the related sign DEBATE, we again find opposition between 
two sides, which are mapped onto each of the two hands. The sign DEBATE is formed sim- 
ilarly to the signs DISCUSS and DISCUSS-IN-DEPTH, with the index finger of the dominant 
hand repeatedly striking the flat palm of the non-dominant hand. However, DEBATE also 
exhibits what is known as “dominance reversal” (Frishberg 1985; Padden & Perlmutter 
1987): in the formation of this sign, the index finger of the dominant hand first strikes 
the non-dominant hand, then the hands switch roles and configurations, and the index 
finger of the non-dominant hand strikes the flat palm of the dominant hand, with this 
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(beginning of sign) (end of sign) 
Figure 11: The ASL sign DISCUSS-IN-DEPTH. 


Table 9: Aspects of the iconic mapping for DISCUSS-IN-DEPTH. 


Form Meaning 

dominant hand one side of an argument 
index finger handshape the direction of attention 
non-dominant hand topics under discussion 
flat palm handshape a written surface 


movement along the same plane two sides communicating with each other 
repeated movement an on-going process 


reversal being articulated several times in succession (Figure 12)? The iconic image mo- 
tivating the form of the sign DEBATE, then, is that one side discusses its case, then the 
other side discusses its own case, and these discussions continue back and forth. The 
iconic mapping for this sign is given in Table 10. 

Coming finally to the sign HAsH-THINGS-OUT, this sign's form is quite similar to the 
sign DEBATE, the two signs differing primarily in that HAsH-THINGS-OUT has a faster 
and smaller series of movements. The form of the sign HASH-THINGS-OUT has undergone 
some restructuring that partially obscures the iconic role of the flat palm as representing 
a written surface, and of the alternation between two distinct points of view. Similarly, 
the sign's meaning is “softened,” still denoting a discussion or negotiation, but with less 
emphasis on the the number and alignment of the participants in the discussion. The 
sign HASH-THINGS-OUT is articulated with the dominant index finger of one hand briefly 
striking the flat palm of the other hand, with this motion alternating between hands 
multiple times in quick succession (Figure 13). This sign is not as amenable to analysis 
in terms of its iconic structure as the preceding signs in this section, as it has undergone 


5 The direction of the movement in the sign DEBATE is also changed; the hands move right-and-left instead of 
forward-and-back as in the previous examples. We suspect that this is because a side-to-side movement is 
easier to articulate while also reversing the dominance ofthe hands, though this changed direction of move- 
ment may well have a semantic motivation (or lend itself to reanalysis based on a semantic motivation), as 
well. 
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(beginning of sign) (end of sign) 
Figure 12: The ASL sign DEBATE. 


Table 10: Aspects of the iconic mapping for DEBATE. 


Form Meaning 

dominant hand one side of an argument 

index finger handshape the direction of attention 

non-dominant hand topics under discussion 

flat palm handshape a written surface 

reversal of dominance yielding the floor to another perspective 
repeated movement an on-going process 


some degree of phonetic erosion: though we can identify, through comparison to the 
related signs DEBATE and Discuss, that the contacting motion between the dominant 
and non-dominant hands is not an arbitrary coincidence, the simplest account for this 
sign is that it is a phonetically reduced and semantically idiosyncratic derivative of the 
sign DEBATE. The sign HASH-THINGS-OUT has drifted both in meaning and in form from 
the sign DEBATE, yielding a new, related sign. 


(beginning of sign) (end of sign) 


Figure 13: The ASL sign HASH-THINGS-OUT. 
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The point of this extended discussion is to demonstrate that there is no clear delin- 
eation between morphology and iconicity in these examples. As with morphological 
(re)analysis, the assessment of an iconic motivation necessarily follows from the pri- 
mary association of a potentially complex form with a potentially complex meaning in 
a holistic sign. Each of the signs discussed in this section can be described both in terms 
of the relationship between the sign’s form and its motivating visual image, and of the 
sign’s conventional form and meaning relative to other conventional ASL signs. Similar 
to the discussion of the sign VERY-sLOw, here we see that aspects of iconic representa- 
tion can also become systematic across groups of signs, and codified as a morphological 
pattern. 

Importantly, an a-morphous theory of iconicity recognizes that lexical signs are the 
primary unit of morphological organization, and an analysis of the relationship between 
meaning and form necessarily proceeds from there. Aspects of form such as the flat 
hand or the extended index finger may come to be associated with aspects of meaning 
by virtue of their systematic re-use across formationally and semantically related signs. 
However, as in spoken language morphology, it is the identifiable parts that may gain 
their meanings by association with their complex wholes, rather than the other way 
around. 


4 Iconicity in word formation 


In this section, we discuss two additional patterns that are iconic and systematic in ASL. 
Both patterns relate to the distinction between morphologically-related pairs of nouns 
and verbs. These patterns are referred to in the literature as the “noun-verb pair” pattern 
and the “handling-instrument” pattern, respectively. 

A quite widely-discussed morphological pattern in ASL concerns pairs of related verbs 
and nouns that differ only in their conventional movement pattern. Following Supalla 
and Newport’s (1978: 100-102) original formulation, these verbs and nouns are related 
pairs of signs such that “the verb expresses the activity performed with or on the object 
named by the noun.” Because they associate verbs and nouns through a non-concate- 
native phonological operation, these pairs of signs have been compared to verb-noun 
pairs in English that differ in terms of syllabic stress (recórd/récord) or vowel quality 
(bleed/ blood), for example. The classic examples are the ASL verb srr and the noun CHAIR: 
both signs are formed with the index and middle fingers extended and held together 
on each hand (a configuration typically referred to as the “U handshape"), and in the 
articulation of both signs, the hands have the same orientation and overall movement, 
with the dominant hand moving to contact the top of the non-dominant hand. However, 
the movement pattern differs between the two signs: srr is signed with the dominant 
hand moving to rest on the top of the non-dominant hand (Figure 14a), while CHAIR is 
signed with a shorter, repeated movement (Figure 14b). 

Across noun-verb pairs, nouns are signed with repeated, restrained movements, while 
verbs are signed with longer, continuous movements: Supalla and Newport identify a 
number of sub-patterns within this broader generalization, and across all pairs that they 
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e 


a. (beginning of sign) (end of sign) 


* ë 


b. (beginning of sign) (end of sign) 


Figure 14: The ASL sign str (a) is articulated with a longer, single movement, 
and CHAIR (b) is articulated with a shorter, repeated movement. 


identify, the nouns are all articulated with a constrained, repeated movement. However, 
several different movement patterns are observed among the verbs. These sub-patterns 
include a single unidirectional movement (as in FLY-AIRPLANE and AIRPLANE); a repeated 
unidirectional movement (as in SWEEP and BROOM); and a repeated bidirectional move- 
ment (as in ERASE-CHALKBOARD and ERASER). 

Relevant for our purposes here is the fact that the forms of these signs remain quite 
iconic in synchronic ASL; not only does the configuration and orientation of the hand 
iconically profile aspects of the referent object, as we discuss below, but, similar to the 
discussion of the sign vERY-sLow in 82, the contrasting movement patterns themselves 
have an underlying iconic motivation (and see Wilcox 2004 for additional discussion of 
this point). Regarding the multiple verbal movement sub-patterns found among noun- 
verb pairs, Supalla and Newport note that "in general, single movement in the sign corre- 
sponds to single, punctual or perfective action. Repeated movement, in contrast, refers to 
durative or iterative activity which is made of punctual actions" (Supalla & Newport 1978: 
103-104). This description suggests that the perception of iconicity has not diminished 
from these signs. These verbs can be analyzed as transparently representing motion with 
motion, with, for example, the single phonological motion of the sign srr motivated by 
the motion of a human body settling on a flat surface. 

The movement pattern for nouns can similarly be analyzed as motivated by meaning 
(see also Wilcox 2004: 131-132). In ASL, repeated forms can represent repeated actions 
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or, more abstractly, a general activity or the instrument canonically associated with an 
action. A derivational process discussed by Padden & Perlmutter (1987: 343), for example, 
is the “activity noun” rule, in which small repeated movements derive the noun ACTING 
from the verb Act, and the noun SWIMMING from the verb swim. This is also consis- 
tent with the cross-linguistic use of reduplicated forms to derive nouns from verbs (e.g., 
Nivens 1993, Kouwenberg & LaCharité 2001, Adelaar & Himmelmann 2005): Kouwen- 
berg & LaCharité (2015: 984), for example, provide kriep-kriep ‘scrapings’ as a noun 
derived through reduplication of the verb kriep ‘to scrape’ in Jamacian, and doro-doro 
‘sieve’ as a noun derived through reduplication of the verb doro ‘to sift’ in Sranan. In 
noun-verb pairs in ASL, as well, the repetition of the verb’s phonological movement is 
used to denote the instrument associated with the action by de-emphasizing the action 
inherent to the verb. 

To make the relationship between related verbs and nouns concrete, in Tables 11 and 
12 we provide the iconic mapping for the signs sir and CHAIR, respectively. These iconic 
mappings are identical except for their phonological movements and the corresponding 
aspects of meaning. These differences in movement mark this pair of signs as participat- 
ing in the “noun-verb” pattern in ASL. 


Table 11: Aspects of the iconic mapping for srr. 


Form Meaning 

non-dominant hand a surface to be sat on 

dominant hand an object in motion 

U-handshape paired human legs 

contacting movement human figure settles on the surface 


single, continuous movement a single, perfective action 


Table 12: Aspects of the iconic mapping for CHAIR. 


Form Meaning 

non-dominant hand a surface to be sat on 

dominant hand an object in motion 

U-handshape paired human legs 

contacting movement human figure settles on the surface 


repeated, constrained movement an object that is acted on 


In our recent work, we have discussed another pattern that distinguishes a subset of 
related verbs and nouns in ASL: the "handling and instrument" pattern (Padden et al. 
2015). Unlike the noun-verb pairs described above, which are distinguished from one 
another based on properties of their movement, handling and instrument signs are dis- 
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tinguished from one another based primarily on their phonological handshapes. As an 
example, in ASL, the concept ‘toothbrush’ can be represented by either of two related 
forms, both of which involve a constrained, repeated movement near the mouth: the 
“handling” form has the hand configured in a variant of the fist handshape, shaped as 
though grasping an imagined toothbrush. The corresponding “instrument” form addi- 
tionally has the index finger extended, representing the shape of the toothbrush, itself 
(see Padden et al. 2015: 82). 

Another pair of signs fitting this pattern are two variant forms for the concept ‘nail 
polish’: Both ‘nail polish’ forms are articulated with the fingers of the dominant hand 
repeatedly brushing the fingers of the non-dominant hand. The handling form is signed 
with the dominant hand in what is known as the “F handshape", with the index finger 
contacting the thumb as though grasping a small, thin brush (Figure 15a), and the instru- 
ment form is signed with the “U handshape", with index and middle finger extended and 
closed, representing the bristles of a small brush (Figure 15b). 


1 


a. (beginning of sign) (end of sign) 


CR 


b. (beginning of sign) (end of sign) 


Figure 15: Two signs meaning ‘nail polish’. The handling form (a) is signed 
with a “grasping” handshape, while the instrument form (b) is signed with a 


“brushing” handshape. 


In ASL, handling and instrument forms can both also function as verbs, for example 
‘to brush one’s teeth’ or ‘to paint one’s nails’, when signed with the appropriate longer 
movement pattern. However, analyzing elicited sentences in a vignette description task 
(Padden et al. 2015), we found previously that ASL signers are more likely to employ 
handling and instrument forms as verbs and nouns, respectively. Unlike the movement 
contrast that distinguishes noun-verb pairs, the association of handling forms with verbs 
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and instrument forms with nouns is a statistically reliable preference, rather than a cate- 
gorical rule. As an example of this preferred pattern, consider the sentence in (1). In this 
signed sentence, the signer used an instrument form to name the object, and a handling 
form to name the action associated with that object: 


(1) NAIL-POLISH, WOMAN PAINT-NAILS 
‘A woman paints her nails with nail polish’ 


In this example, the topicalized sign NAIL-POLISH is identifiable as a noun because 
of the semantics of the sentence, as well as its participation in the movement-based 
noun-verb pattern described above: This sentence was uttered as a description of a short 
vignette in which a woman painted her nails, and in this sentence, the sign NAIL-POLISH 
is articulated with the short, restrained movement that is characteristic of derived nouns. 
In contrast, the sign PAINT-NAILS is identifiable as a verb: it is articulated with several 
longer, unidirectional movements to represent ‘brushing’ as an on-going process. In this 
sentence, the handshapes for the noun NAIL-POLISH and the verb PAINT-NAILS also differ: 
NAIL-POLISH is formed with the index and middle finger together representing the brush 
used to apply nail polish, while PAmNT-NAILS is formed with the fingers configured as 
though handling the brush as a small object. These aspects of form and meaning for the 
signs NAIL-POLISH and PAINT-NAILS can be represented as in Tables 13 and 14, below. 


Table 13: Aspects of the iconic mapping for NAIL-POLISH. 


Form Meaning 

dominant hand the hand of an agent 
U-handshape the bristles of a small brush 
non-dominant hand a human hand 


repeated, constrained movement an object that is manipulated 


Table 14: Aspects of the iconic mapping for PAINT-NAILS. 


Form Meaning 

dominant hand the hand of an agent 
F-handshape grasping a small object 
non-dominant hand a human hand 


repeated, unidirectional movement the repeated action of a human agent 


The preferential pairing of instrument forms with nouns and handling forms with 
verbs can be analyzed as motivated by the fact that in a handling form, the phonological 
structure profiles the action performed by a human agent (see also Brentari et al. 2012 
and Hwang et al. 2016). The phonological structure of the instrument form additionally 
profiles the shape of the object used to perform the action. 
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In these “noun-verb pair” and “handling and instrument” examples, it is possible to 
associate an aspect of form (such as a handshape or movement pattern) with a syntactic 
category (such as noun or verb) and/or an aspect of meaning (such as agency or dura- 
tion). But these consistent form-meaning pairings are identifiable only in comparison to 
other signs: There is no recurring “handling affix” or “derived noun affix” to mark these 
patterns. Instead, both noun-verb pairs and handling-instrument signs are distinctive 
patterns recognizable only through their iconicity. They are paradigmatic relationships 
that are identifiable on the basis of their iconic motivations. 


5 Conclusion 


In this chapter, we have taken inspiration from Anderson’s a-morphous theory of morph- 
ology, which views Saussurean signs as holistic pairings of potentially complex form and 
potentially complex meaning. From this perspective, rather than complex words deriv- 
ing their meanings from the meanings of their parts, it is instead the parts of a complex 
word that may derive their meanings from the whole words that they appear in. We have 
demonstrated that this “a-morphous” view provides a fresh perspective on sign-internal 
motivation, regardless of whether this motivation can be considered morphological or 
iconic. In sign languages, the perception of iconicity or morphological complexity arises 
from speaker (re)analysis of the relation between form and meaning among the words 
of a language. Any whole word may drift in form and meaning in a way that obscures 
its original, analyzable internal structure. However, signs also often receive analogical 
support from other, related signs. As the field of sign language linguistics continues 
to recognize the relationship between iconicity and morphology as related aspects of 
motivation in linguistic structure, Anderson's insights will continue to provide a useful 
framework for analysis for some time to come. 
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Stephen Anderson warns that approaches to linguistic universals that derive facts about 
language from the structure of the Language faculty or from external forces shaping the 
Primary Linguistic Data— where these two sources must be taken as mutually exclusive 
- not only are difficult to support since the two modes of explanation are entangled, but 
wrong (Anderson 2008). External forces shaping the Primary Linguistic Data can result in 
grammatical regularities — true properties of language - but, in order to recognize them, 
we must "take into account the filtering role of the perceptual systems through which these 
[brute physical facts] are presented to the mind for interpretation" (Anderson 2016: 13). 
In the communication systems of other species we find that particularities of biology are 
connected to pathways for messages. The same should be true for humans. "Why, in fact, 
might we be tempted to believe otherwise?" (Anderson 2011b: 364). Why, indeed? This paper 
argues for iconicity chains in sign languages, chains consisting of mappings from various 
perceptual systems into a visual realization and then into a semantic sense, allowing insight 
into why unrelated languages might exhibit similar signs for abstract concepts. 


1 Introduction 


Stephen Anderson has always been a brave intellect. He eschews theoretical blinders 
when he faces data. Had he been among those to investigate sign languages at the start, 
perhaps the present revolution in sign language analysis would never have needed to 
occur or would have occurred decades ago. For we are, indeed, in the middle of a revolu- 
tion. Much of the early work in sign language linguistics was dedicated to showing how 
much sign languages are like spoken languages, in order to establish beyond a doubt that 
sign languages are bona fide languages (Vermeerbergen 2006). Having established that, 
present research is paying significant attention to modality-driven phenomena (Meier 
2002; Woll 2003). In particular, the existence and extent of iconicity can now be studied 
outside the closet. 

The number of investigations on iconicity in language is increasing exponentially, 
with international conferences occurring multiple times a year for the past few years. 
Given that iconic signs quickly get conventionalized to the point where they are pro- 
cessed by the brain in the same way as bundles of arbitrary features (Emmorey et al. 2004; 
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Fabisiak & Rutkowski 2011), there may be little cognitive difference between iconic and 
non-iconic signs. However, the recognition of iconicity is critical to understanding how 
it interacts with grammar (Meir et al. 2013). While many avenues of inquiry into iconicity 
are presently being explored, this paper is a call for scholars of sign languages to delve 
more deeply into the possibility of synesthesia as a source of iconicity. As Anderson 
(2011a; 2016) so rightfully alerts us, our perceptions can act as filters in communication 
systems. Here I propose an example: iconicity chains that map from any perception or 
other somatosensory information into a visual realization and from there into a semantic 
sense. 

Sections 2 through 4 are a whirlwind introduction to iconicity in general. They are 
far from exhaustive, though I have attempted to be representative with respect to sign 
languages. References, in particular, are only a sampling, and I apologize to all the fine 
work I do not mention. §5 is an outline of the kinds of questions involving synesthesia 
that I hope to direct more attention to and an introduction to the notion of iconicity 
chain. 


2 Background on iconicity 


The term iconicity as it pertains to language was originally applied to non-arbitrary map- 
pings from form to meaning; at the lexical level, essentially a sign that looked like what 
it meant, or a word that sounded like what it meant was labeled iconic. For example, 
with respect to action, in many (all?) sign languages the sign for ‘eat’ involves moving 
hand to mouth; the movement of the sign imitates an essential act in eating for humans 
(in most situations). The signer embodies the actor in the event. Likewise, with respect 
to concrete objects, the sign for ‘deer’ in many (not all) sign languages involves hands 
on either side of the head with one or more fingers extended (antlers as identification). 
The signer again embodies the referent, here mapping relevant parts of a deer onto the 
human body of the signer. For spoken languages, iconic words include imitations of an- 
imal sounds (moo, peep) and words whose sound evokes the sound of their sense (bell, 
slam). This phenomenon is labelled onomatopoeia, about which a great deal has been 
written. 

Recent work gathers within the iconicity aegis a wider range of structural alignments 
- thus a sign/word can be called iconic if its articulation brings to mind its sense - that 
is, it is motivated (Russo 2004; Perniss, Thompson & Vigliocco 2010), where this exten- 
sion of the concept is often culture-based (Adam et al. 2007). For example, joined arms 
swinging in front of the torso form the sign for ‘baby’ in many languages - bringing to 
mind rocking an infant. Again, we have embodiment - but not of the referent ‘baby’, 
rather of someone doing a typical action regarding a baby. Perhaps the long-recognized 
phonesthemes among the IndoEuropean languages (recognized for hundreds of years, 
in fact; see Drellishak 2006) should be included here. For example, words that start with 
the fricative [s] followed by the voiceless stop [t] often deal with lack of motion, in- 
cluding figurative motion (stay, stand, stupid, stymie, stammer, stuck, stagnant, stutter, 
...), but not always (start, stamp, stuff, stag...), while words that start with the fricative 
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[f] followed by the liquid [1] often deal with quick motion, again including figurative 
motion (fleet, flit, flick flow, fly, flutter, flame, floozy, flip ...), but not always (floor, flat, 
flab, flacid...). This more catholic approach to iconicity allows one to assess its role in 
language development and language processing (Emmorey 2014; Perniss & Vigliocco 
2014). 


3 Iconicity involving the perception of vision only 


3.1 Mappings outside language proper 


Iconic mappings from a visually perceived message to meaning can belong to non-verbal 
communication (Argyle 1975; Knapp, Hall & Horgan 2013) or to language proper. 

With respect to visual communication outside language proper, much gesture that 
accompanies speech (co-verbal gesture) or that occurs independently of speech commu- 
nicates information of a general sort (such as attitude or emotional/intellectual involve- 
ment), or information supportive to the speech material (such as gesturing the shape of 
a vase as one talks about arranging flowers) (McNeill 1992; 2000; Goldin-Meadow 1999; 
Kendon 2004; Ozyürek 2014), or information that promotes discourse coherence (Las- 
carides & Stone 2009). While typological categories for gestures can help us get a sense 
of the complexity involved in studying gesture, they are not discrete (Streeck 2009). And 
there are areas of gesture use that have been examined only briefly with a linguist’s eye, 
but with such insight that they beg for further comparison to other uses, such as gestures 
in conducting orchestras (Boyes Braem & Braem 2000). 

Other methods of visual communication can give precise information, including quan- 
titative, though usually it is very limited in range, such as baseball signals (Komissaroff 
2016), diving communication (Recreational Scuba Training Council 2005, and described 
in Mišković et al. 2016), and gestural systems used in hunting (Hindley 2014). Relevant 
mentions of systematic iconic mappings are found in work on semiotics with regard 
to flag signals, comics, the visual arts (all discussed in Berger 1984), auditory signals in 
military aircraft (Doll & Folds 1986), and others. 

My call to arms in this paper is directed at scholars of language proper. Still, it might 
be relevant also to scholars of two other areas. One is mime, where the new and intrigu- 
ing initial comparisons to sign languages in Sutton-Spence & Boyes Braem (2013) are 
suggestive. The other is gesture, for which there are multiple reasons to suspect that 
the directions of investigation suggested in §5 might be relevant. First, much research 
shows alignments between sign languages and gestural communication (such as Hall, 
Ferreira & Mayberry 2013). Second, home sign has many similarities to sign languages 
(Goldin-Meadow 2005, among many), and homesigning children at first base much of 
their communication on iconic gestures, but tend to modify them over time in much 
the same way that iconic signs change (such as going from two-handed gestures to one- 
handed ones; see Tomaszewski 2006). Third, emergent sign languages tend to quickly 
move from elaborate gestures to simplified signs (Kocab, Pyers & Senghas 2014), but 
iconicity manages to persist (Hwang et al. 2016). Fourth, children go through an in- 
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termediate stage in which they use gesture-word combinations as they transition from 
single-word utterances to more complex phrases (Capirci et al. 1996). And, fifth, there 
are arguments for a gestural origin of all manual linguistic systems (Ortega & Morgan 
2015). 


3.2 Mappings within language proper: Sign languages 


Here we look at sign languages. There are also interesting observations to be considered 
about language represented in text, as noted briefly in the next subsection. 

Meaning-form relationships are apparent across the grammar in sign languages, so 
much so that sign languages do, in fact, have much in common (Woll 2003). The discus- 
sion that follows is generally informed by the foundational work on metaphor of Wilcox 
(2000) and Taub (2001), both of whom suggested analyses that are only recently finding 
confirmation in the experimental work of others. For example, Meir (2010) shows that 
conceptual metaphors involve double mappings, one between source and target domains 
and the other between form and meaning iconicity — and both must “work” in order for 
the entire metaphor to “work”. The pervasiveness of iconicity develops meta-linguistic 
skills that have been argued to be behind the fact that deaf people who use different sign 
languages can establish rich communication with each other much faster than hearing 
people who use different spoken languages (Zeshan 2015). Below I focus on the manuals, 
but the nonmanuals are often iconic, as well (as in Pizzuto et al. 2008). 

In some signs the shape that the manuals assume mimics the shape of the referent or 
some visual form peculiar to the referent (Pizzuto et al. 1995; Pietrandrea & Russo 2007); 
in others the manuals draw outlines of the shape of the referent; in others the size of 
articulation corresponds to the size of the referent. The number of hands in a sign can 
be iconic: signs tend to recruit two hands for senses that encode relationship types (in- 
teraction, location, dimension, and composition) (Lepic et al. 2016). Lexical items cluster 
into families, with an under-specified meaning conveyed by the parts of the signs that 
are in common, which is typically iconic, and specific information added by the variable 
parts of the signs, which might or might not be iconic (Fernald & Napoli 2000). Lepic and 
Padden (this volume) go so far as to say that iconicity is morphology; internal structure 
of signs might be obfuscated by phonological change, but it will still be reinforced by 
the signers’ knowledge of multiple related signs. 

Sign languages vary in many ways on how they encode space (Perniss, Zwitserlood 
& Ozyürek 2015), yet repeatedly the encoding is iconic (Vermeerbergen 2006). Point of 
view is expressed iconically through spatial alignments and relationships (Pyers, Perniss 
& Emmorey 2015). While there is debate over whether spatial loci are logical variables 
(Lillo-Martin & Klima 1990; Neidle et al. 2000) or purely iconic mechanisms that are 
not linguistic at all (Cuxac 1999; Liddell 2003), recent work allows insights from both 
camps in a formal semantics that “makes provisions for iconic requirements at the very 
core of its interpretive procedure” (Schlenker, Lamberton & Santoro 2013: 91, and see 
Giorgolo 2010). Likewise there is debate over whether agreement is morphological or, 
instead, iconic and non-linguistic, since locations in space represent locations in mental 
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space, numbers of extended fingers can indicate numbers of referents, and direction of 
movement can indicate direction of change of transfer (Meier 1987; Janis 1995; Mathur 
2000, vs. Liddell 1995; 2003; and for discussion relevant to this issue based on an atypical 
child signer, see Quinto-Pozos et al. 2013). Again, an appropriate formal semantics can 
make a comprehensive analysis based on the insights of both camps (Schlenker 2016). 

Iconicity plays a role in delivering information about event structure, such as telicity 
(Wilbur 2003; 2008; Strickland et al. 2015; Schlenker 2016), and whether and at what 
rate an event is repeated (Kuhn & Aristodemo 2016). Path shape, speed, and other dy- 
namics of movement can indicate location and dynamics of the action, particularly in 
classifier predicates, where languages can vary on how iconic they are (Aronoff et al. 
2003; Tumtavitikul, Niwatapant & Dill 2009). Temporal ordering of signing corresponds 
to temporal ordering of visualization of the participants and action in events (Napoli 
& Sutton-Spence 2014; Napoli, Sutton-Spence & Quadros Forthcoming). Since there are 
multiple articulators in sign languages (that is, two manuals plus a variety of nonman- 
uals), more than one message can be conveyed at once (Vermeerbergen, Leeson & Cras- 
born 2007; Napoli & Sutton-Spence 2010). Simultaneous articulation of two events that 
occur simultaneously is a further kind of iconicity. Additionally, a signer can embody 
a participant in the event being conveyed, which is an iconicity similar in ways to pan- 
tomime (Metzger & Bahan 2001). 

Meaning-form relationships are exploited across all components of the grammar in 
innovative creative language (Sutton-Spence & Kaneko 2016), as in poetry (Bauman, Nel- 
son & Rose 2006; Sutton-Spence & Napoli 2010; Sutton-Spence & Napoli 2013), humor 
(Sutton-Spence & Napoli 2009), and taboo expressions (Mirus, Fisher & Napoli 2012; 
Napoli, Fisher & Mirus 2013). Much of this iconicity is founded on the cognitive topol- 
ogy involved in mapping the parts of a non-human entity onto the signer’s body, since 
the poet/humorist will typically embody in turn the major characters in order to show 
how each referent’s experience can be revelatory of the human (usually deaf) experience. 
Often attention is directed to the physical realities of the articulators. For example, one 
American Sign Language (ASL) joke concerns a woman on a diet tempted by a cookie. 
The sign TEMPT is made on the non-dominant elbow and the sign COOKIE is made on the 
non-dominant palm. In the joke the dominant hand “runs” from the elbow, along the 
forearm, to the palm of the non-dominant hand -exploiting the physiological connect- 
edness of the two locations to show us the easy path from temptation to sweets. 

Certainly, with respect to sign languages, the judgment of whether a sign is iconic or 
not can be so particular to a culture that it is non-obvious to those outside the culture. 
(Here and throughout this paper I offer speculative remarks on signs from different coun- 
tries. If I do not cite a source, my information comes either from personal knowledge 
or from the website spreadthesign.com.) Iconic handshapes used in handling classifiers 
and object classifiers (otherwise known as entity classifiers), for example, express an 
agentive/non-agentive distinction in many sign languages; if the event is agentive, the 
handling classifier is used, and if it is non-agentive, the object classifier is used, where a 
recent comparison of the sign languages of Italy and America shows that cultural factors 
contribute to the conventionalization of which type of handshape will be used to convey 
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a given event (Brentari et al. 2015). Iconicity involving temporal succession and cyclic- 
ity reflects cultural conceptions of time and thus presents similarities and distinctions 
across sign languages (Kosecki 2014). 

A single example can help seal the point about culture. The sign for ‘rent’ in the sign 
language of Portugal (ALUGAR) is related to the sign for ‘pay’ (PAGAR) with an aspec- 
tual marker for repetition. Both are iconic. In the sign PAGAR the dominant hand taps 
the palm of the nondominant hand. This brings to mind putting payment in someone’s 
waiting palm. In the sign ALUGAR the same handshape on the dominant hand makes a re- 
peated circle going toward the addressee, bringing to mind paying repeatedly. Now let’s 
compare to the sign RENT in America; it is morphologically related to the sign MONTH, 
neither of which at first looks iconic. In the sign MONTH a 1-handshape on the dominant 
hands move down the back of a 1-handshape on the nondominant hand. Only if you 
know that we read down the calendar’s representation of the months in America and 
Canada (where this sign language is used) do you see the iconicity. The sign RENT is the 
sign MONTH with reduplication, so that the dominant hand circles back to repeat that 
downward motion. Only if you know the further fact that rent is generally paid on a 
monthly basis do you see the iconicity. 

An additional complication to recognizing iconicity is that one language’s sign may 
focus on certain visuals of the meaning while another’s may focus on different visuals. 
Again, those visuals might be culture-based or not. The sign for ‘dance’, for example, 
can focus on movement of the torso (in the sign languages of Italy and Turkey, among 
others), of the legs (in the sign languages of America and Japan, among others), of the 
arms (in the sign languages of Germany and India, among others), of the whole person in 
relation to another person (in the sign languages of Austria and Estonia, among others), 
or maybe just on general movement (in the sign language of Iceland). Without knowing 
what the sign means ahead of time, one may be at a loss to guess its meaning just from 
seeing it, but once the meaning is given, one might quickly recognize its iconicity. 


3.3 Mappings within language proper: Print 


Written/print representations of spoken languages can use visual information iconically, 
as in concrete poetry, popular in Greek Alexandria during the Third and Second Cen- 
turies BCE and intermittently up to modern times (Newell 1976). But there are other, 
less obvious ways that print/writing can be iconic. Verbal constructions used in writ- 
ing affect readers’ ability to understand discussions of spatial relationships, the account 
being that the order in which verbal material is presented on the page can help or hin- 
der as one tries to mentally construct spatial models (Moeser 1976; Morris & Bransford 
1982; Ehrlich & Johnson-Laird 1982; Louwerse 2008; Zwaan & Yaxley 2003). In a read- 
ing test that involved static spatial relations, Zwaan (1993: 119) found, “If a text presents 
spatial information in a scattered way, spatial representations are relatively weak, even 
for subjects who are instructed to form spatial representations.” When people read, they 
make a mental representation of the orientation (Stanfield & Zwaan 2001) and shape 
(Zwaan, Stanfield & Yaxley 2002) of objects. Readers “mentally simulate the visibility 
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of objects during language comprehension” (Yaxley & Zwaan 2007: 229). Additionally, 
in understanding referents in reading, it appears that, if too much verbal material inter- 
venes between two mentions of a referent, interpretation is hindered; again it looks like 
mentioning a referent foregrounds it in one’s mental representation of the text (Sanford 
& Garrod 1981). 


4 Iconicity involving the perception of sound only 


Iconic mappings from an aurally perceived message to meaning are used in spoken lan- 
guages, of course, but they are also used in Morse code (a code based on written language) 
and sound signals (whistles in baseball, sirens, melodies). Studies of iconicity in spoken 
language are experiencing a renaissance, just as studies of iconicity in sign language are 
(Perniss, Thompson & Vigliocco 2010). There is a growing consensus (Vigliocco, Perniss 
& Vinson 2014; Goldin-Meadow & Brentari 2015) that with respect to iconicity in order 
to really understand the extent of it in spoken language we should be considering speech 
plus gestures, not just speech. In this section, however, I discuss only mappings from 
auditory form. 

In here falls onomatopoeia, mentioned in §2. Spoken languages can also play with 
intensity, duration, and pitch in iconic ways (say angry in a loud, angry voice; say slow 
with a drawn out syllable nucleus; say little girl with a very high pitch). But spoken lan- 
guages can move beyond that to sound articulations that bring to mind a meaning - that 
is, associative iconicity (in the sense of Fischer 1999) — just as sign languages do (as in the 
discussion of signs meaning ‘baby’ in 82). In this regard, sometimes particular features of 
sounds are associated with meaning in a relatively stable way across several languages 
(Sapir 1929; Taylor 1963; Werner & Wapner 1952; Wertheimer 1958, among early studies, 
and Hinton, Nichols & Ohala 1994; Voeltz & Kilian-Hatz 2001, among more recent stud- 
ies). For example, in many languages high pitch is associated with small size of referent, 
whether the referent be entity or action (Jespersen 1922; Nuckolls 1999). Perhaps a high 
pitch brings to mind the voices of smaller people (Evans, Neave & Wakelin 2006), and 
so the size association spreads from people to any referent. For English, some claim a 
back rounded vowel is gloomy while a front low vowel is brash - compare English drip 
(a relatively high pitch vowel) to drop and both to droop (a back rounded vowel); and 
slip (a relatively high pitch vowel) to slap (a front low vowel). Correspondences can be 
so strong that manufacturers capitalize on them when naming products (Spence 2012). 
Additionally, it's been shown that prosody works together with segmental information 
as cues for iconic interpetations of words (Dingemanse et al. 2016). 

Just as in spoken languages, iconicity can be felt beyond the lexicon, where the dis- 
cussion here is generally informed by Fabisiak & Rutkowski (2011). For example, morph- 
ology can be iconic: reduplication (Moravcsik 1978) can indicate plurality (Macdonald 
1976) or intensification (Murane 1974). We can witness iconicity even in syntax. Moul- 
ton & Robinson (1981) show how a radio announcer can order and pace words to reflect 
the order of participants in an action and the timing of that action. The order of tem- 
poral subordinate clauses within the next adjacent clause up often reflects the temporal 
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relationship between that clause and the action of the adjacent clause (Haiman 1985; 
Kortmann 1991; Diessel 2005). 

Conventional judgments of iconicity in spoken languages are affected by cultural fac- 
tors, just as they are in sign languages. That’s obvious for things like phonesthemes, 
where a given speaker is making associations across multiple lexical items in a language. 
But it also occurs in onomatopoeia. It’s instructive to peruse Derek Abbott’s (2004) ani- 
mal sound website in this regard. Granted, as Abbott points out, “a Swedish Vallhund is 
not an Anatolian Shepherd or a Japanese Spitz. But variations in dog breeds can’t fully 
account for these differences...” (Friedman 2015). Sound iconicity runs the gamut from 
realistic to simply bizarre: words for animal sounds might actually ring true to a farmer 
(kpok, kpok kpok kpok) or other person with direct experience, while others don’t ring 
true to anyone (cockadoodledoo). Often the same animal’s sound is rendered distinctly 
differently in different languages. People simply somehow agree to accept a given sound 
in speech as the conventional rendering of the animal (or other type of) sound. Perhaps 
digging into the culture will allow a better understanding (as often happens in linguistic 
study, see Duranti 2009). 

A huge gap in our discussion thus far is so-called mimetics. We now use them as a 
jumping off point to other types of iconicity in language in §5. 


5 Cross-modal iconicity 


5.1 Mimetics and iconicity chains 


Mapping from a visual entity to a meaning that involves vision or from an auditory 
entity to a meaning that involves sound is relatively straightforward. But this is not 
the only kind of iconic mapping found in language. As we saw for both sign languages 
and spoken languages, associative (or motivated) iconicity can occur. At times these 
associations seem to belong fairly generally to the human experience, such as the sight 
of rocking layered arms meaning ‘baby’ and a high-pitched vowel adding small size to 
the meaning of a word. But, mostly, linguists seem to have assumed a cultural basis 
for these mappings, often pointing out how the mappings are particular to a specific 
language. 

However, there are important challenges to that assumption. In some spoken lan- 
guages recognizable patterns of sounds indicate a non-arbitrary relationship between 
form and sense - where the words with this property are labelled mimetic and the phe- 
nomenon is called sound symbolism. Korean, for example, exhibits correspondences 
between sound and subjective impressions or other modalities (smell, taste, vision) as 
well as “size, mood, movement, shape, and other perceptual and psychological experi- 
ences" (Cho 2006: 64), where changes in vowels and consonants systematically relate to 
meaning differences. Japanese, instead, uses templates (fixed patterns of consonants and 
vowels) to indicate mimetics (Hamano 1998) For example, the correspondence between 
an action (lick, roll) or a property (drunk, exhausted) and the sound of the word refer- 
ring to that action/property feels non-arbitrary to Japanese speakers. Correspondences 
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as precise as between a sound and an emotional reaction to a taste are felt to be firm (Kag- 
itani et al. 2014). The surprising (perhaps at first astonishing) part is that adult speakers 
of English and Japanese as well as Japanese toddlers are sensitive to mimetic correspon- 
dences in novel verbs (Imai et al. 2008). Thus there is evidence of something going on 
that is not grounded fully in culture - something that allows cross-modal iconicity. 

That something may be simply the physical nature of language - something that Ste- 
phen Anderson insists we acknowledge. Language is expressed through the body and 
governed by the brain - both complex physical entities. It's possible to step back for a 
wider perspective on iconicity, a biological perspective, which has the potential to of- 
fer a more comprehensive understanding. The contribution of biology to sign language 
iconicity has been recognized before: Woll (2009: 150) points out that since "the visual 
medium affords the identification of objects and their spatial locations as a function of 
their forms and locations on the retina and sensory cortex, it is not surprising that cor- 
tical systems specialised for such mappings are utilised when sign languages capture 
these relationships" Here I widen the perspective further based on recent findings in 
cognition that there is multisensory integration in the midbrain and cerebral cortex of 
mammals (Stein & Stanford 2008, among many). 

It is important to recognize at the outset that abstract entities can have physiologi- 
cal effects on the human body. Our own emotions, for example, trigger bodily sensa- 
tions through the somatosensory systems, including activation of cardiovascular, skele- 
tomuscular, neuroendocrine, and autonomic nervous systems (Nummenmaa et al. 2014). 
Additionally, visual recognition of another's emotions (through observation of their fa- 
cial expressions and body postures) can trigger somatosensory reactions (Rudrauf et al. 
2009; Sel, Forster & Calvo-Merino 2014). This is why, all other things being equal, we 
smile when someone smiles at us (Niedenthal 2007). Additionally, emotions have a wide 
range of effects on our behavior and psyche, including our moral judgments (Charland 
& Zachar 2008), which might offer another way to recognize emotions in others. 

For our purposes, the entities of the world can be organized into three groups: 


those that have a realization apparent in any way (to any part ofthe somatosensory 
system - such as wind or respect) 


those that have a realization apparent to sight (you could draw them, for example 
— such as cars) 


those that have a realization apparent to hearing (you could record them, for ex- 
ample - such as rain) 


These three groups are not discrete; they relate as in the schema in Figure 1. 

All iconic mappings between distinct entities rely on metaphor, metaphony, and/or 
analogy, all filtered through experience, so all of them are complex. However, some 
of the mappings are more complex than others. With respect to Korean and Japanese 
mimetics, for example, we can find mappings between a sound on the one hand and 
a taste or a mood on the other (Garrigues 1995; Iwasaki et al. 2013). That is, we have 
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Figure 1: Entities of the world organized by type of realization 


a mapping between something in the area C and something in the area A-C. So the 
mapping is cross-modal. 

Iconic mappings in speech or sign into abstract meanings (which have neither visual 
nor auditory realizations) are necessarily cross-modal since they are mapping between 
elements in B or C and elements in A - (B u C). Metaphor is the key mechanism used in 
sign languages for this kind of conceptual mapping (Borghi et al. 2014). Many conceptual 
metaphors hold both in spoken and sign languages (Roush 2016). 

Metaphoric mappings that cross perceptual modalities are labeled synesthesia (Cy- 
towic 1989, among many). We find mappings between many different senses, where a 
given sense S; can elicit a different sense S, and vice-versa. For example, texture can 
elicit visual forms (Simner & Ludwig 2012; Ludwig & Simner 2013) and visual forms can 
elicit textures (Albertazzi et al. 2016). 

Many instances of synesthesia map a variety of senses onto auditory forms. Thus 
the odors of perfumery are mapped onto pitches (Belkin et al. 1997) and food smells are 
mapped onto the sounds of musical instruments (Crisinel & Spence 2012). Likewise, we 
find mappings from a variety of senses onto visual forms. With regard to odor again, 
Dematté, Sanabria & Spence (2006) examine odors mapped onto color hue, and Kemp & 
Gilbert (1997) show that intense smells are associated with darker colors. Hanson-Vaux, 
Crisinel & Spence (2013) look at the mappings from odors onto visual shape entities, 
where intense, unpleasant odors elicit angular shapes and other odors elicit rounded 
shapes. 

Importantly, both hearing and deaf people can smile when a child says, “This tastes 
green? In fact, congenitally blind people and congenitally deaf people are sensitive to 
synesthesia, and in the same ways as sighted, hearing people (Ittyerah & Mitra 1988). 
A recent study shows that color-shape associations, in particular, are consistent across 
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deaf and hearing people (Chen et al. 2014). Further, sign recognition involves reference 
to sensory-motor properties of one’s own body (Corina & Gutierrez 2016), which would 
be compatible with sign formation likewise involving such reference. In sum, there is 
every reason to expect to find the effects of synesthesia in sign languages as much as in 
spoken languages. 

Cross-modal mappings can have multiple layers of complexity. Let’s say that we want 
to convey the sense of ‘happy’ in a sign language. What visual representation might we 
appeal to? A smile might come to mind. And some languages use the smile, though they 
draw it (in a variety of ways) rather than have the signer actually smile (such as the 
sign languages of Italy, Latvia, Portugal, and Turkey). The problem with using an actual 
smile is that that affective facial expression might conflict with other information in the 
message. If we ask in a sign language, “Are you happy?” we might well not smile, but 
use a neutral mouth as other nonmanuals articulate (such as the eyebrows raising). If 
we say in a sign language, “I’m not happy; it might be odd to smile (except in contrived 
circumstances). So the sense ‘happy’ has to be conveyed some other way. I conjecture 
that this way is through a chain of mappings, a chain which necessarily has precisely 
two cross-modal links — one from an entity that has a somatosensory realization other 
than visual to an entity that has a visual realization, and the next from that visual entity 
to a third entity that has a somatosensory realization other than visual (and see the 
distinction between two types of icons in Peirce 1932). An iconicity chain has complexity 
similar to that of the double metaphors of Meir (2010), but iconicity chains are culture 
independent, since they are grounded in biological properties. 

In this regard, consider the sign HAPPY in ASL in Figure 2. (Figure from Lifeprint: 
http://www lifeprint.com/asl101/pages-signs/h/happy.htm) 


Figure 2: HAPPY in ASL 


Here the hands hit the chest then circle away and hit again repeatedly. My conjecture 
is that this sign is built on the fact that physical activity correlates highly with happiness 
(Kahneman, Diener & Schwarz 1999) and with a heartbeat that we are conscious of. So 
it's like our heart is hopping inside our chests. The mapping is a chain from a hopping 
heart (which we cannot see, but we can feel) to an external representation of that heart 
(the visuals in Figure 2) and then to the abstract meaning ‘happy’. So we are going from 
one somatosensory sense to vision and then from vision to a sense that is an emotion. 
Both links of the chain are cross-modal and both are iconic, as shown in Figure 3. 


527 


Donna Jo Napoli 


intemal sensation of excited 
(hopping) heartbeat 


hands hopping on chest to give visual 
representation of that heartbeat 


sense ‘happy’ 


Figure 3: Schema of the iconicity chain for HAPPY in ASL 


This iconicity chain between heart(beat) and happiness has its origins in our biological 
selves; it digs down inside our bodies. As such, it is culture-independent and we might 
expect to find sign languages using a chest-touching sign for ‘happy’ both among those 
that are genetically related to or have had contact with ASL (as happens in the sign lan- 
guages of France and India) and among those that most likely developed independently 
(as happens in the sign languages of Austria, Germany, Iceland, and Sweden). 

If we are alerted to likely possibilities for synesthesia in sign languages - that is, for 
iconicity chains - not only might we find them, but we might realize that signs we 
previously considered arbitrary are not. I contend this not only because of my own 
initial examinations, but because, on a theoretical basis, it should be true. Communica- 
tion systems are subject to two potentially conflicting pressures: transmission efficiency, 
which drives them toward messages that are simple to produce (and perceive), and com- 
prehension efficiency, which drives them toward messages that are semantically clear. 
As Roberts, Lewandowski & Galantucci (2015: 52) show in experimental work, “where 
iconicity was available, it provided scaffolding for the construction of communication 
systems and was overwhelmingly adopted? This goes hand-in-hand with work that 
shows that iconicity facilitates both recognition and production of signs (Vinson et al. 
2015). In other words, comprehension efficiency trumps transmission efficiency. Rightly 
so. Non-arbitrary mappings, if they can be made, should be, since they enhance inter- 
pretability (and see Givón 1989). To do otherwise would be contrary to the overall goal 
of communication (and see Napoli, Sutton-Spence & Quadros Forthcoming). 
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In the rest of this section I briefly mention two areas for research in language synes- 
thesia outside of speech, then focus on the area of potential sources for iconicity chains 
in sign languages. 


5.2 Potential for iconicity chains in braille 


Blind hearing people can access speech easily, thus, generally, language access is high, 
including access to all the mechanisms for synesthesia that speech uses. However, lan- 
guage represented in print is visual, thus inaccessible. The bumps in a braille cell allow 
print to be accessed tactilely and, because of this, a range of possibilities for cross-modal 
iconicity in language arise, where the connection between the two links in the iconicity 
chain is a tactile entity (comparable to the visual entity in sign language iconicity chains). 
So far as I know, such possibilities are not well-explored, though the iconic possibilities 
in rendering musical scores and literature on music theory are many (Pacun 2009; John- 
son 2009). Given that cutaneous perception is high, it’s no surprise that there’s ongoing 
research on creating tactile icons (tactons) that include rhythm, location, frequency, am- 
plitude, and duration of a tactile pulse, where iconicity influences the choices designers 
are making on attaching meaning to the various tactile factors (Brewster & Brown 2004), 
as well as V-Braille, a way to haptically represent Braille characters on a touch screen 
(such as on a mobile phone) (Jayant et al. 2010). 


5.3 Potential for iconicity chains in tactile sign 


The deaf-blind cannot access language visually or aurally. Thus, another area of explo- 
ration regarding synesthesia is tactile language for the deaf-blind, a complex of methods, 
such as fingerspelling, on-body signing, and hand-over-hand signing (Edwards 2012, and 
following). Research has shown that visual and tactile iconicity ratings of signs are simi- 
lar across blind people, deaf people, and hearing-sighted people, allowing indications of 
which signs should be most salient to deaf-blind children (Griffith, Robinson & Panagos 
1983). 


5.4 Potential for iconicity chains in sign languages 


Here I present the main course of this drawn-out feast in the form of a sampling of 
cross-modal associations that could serve as fertile ground for iconicity chains in sign 
languages. The iconic chains that interest me below involve mappings from an entity 
that is not visually apparent onto an entity that is visually apparent and then onto a 
meaning that is not visual in nature. Thus both links in these iconic chains are cross- 
modal. These are conceptually the most complex iconicity chains and, thus, might be 
the hardest to recognize. 

The growing literature on synesthesia includes mappings involving forms that have 
realizations in various aspects of the somatosensory experience, including texture, tem- 
perature, weight, taste, smell, shape, emotions - the sorts of things that many of the 
mimetics in Korean and Japanese are based on. For humans, “meanings” associated with 


529 


Donna Jo Napoli 


such forms, as discussed in the literature cited below, tend to be general and abstract 
rather than particular and concrete (to be contrasted with the “meanings” such forms 
might convey to nonhumans). For example, with respect to associations from a visual 
entity, a color might be associated with a broadly understood emotion (Johnson, Johnson 
& Baksh 1986), and with respect to associations from an auditory entity, pitch and tempo 
in music might be associated with a broadly understood emotion (Hevner 1937; Brower 
2000). Often the emotion is hedonic, since the orbitofrontal cortex, which is involved in 
sensory integration, is also involved in hedonic experience (Kringelbach 2005). Evoking 
such general mappings has been claimed to be effective in psychopathology therapies 
(Lang 1979, and much work since). 

Let’s look at five potential sources for iconic chains in sign languages: mappings from 
texture, temperature, weight, taste, and smell. 


5.4.1 Mappings from texture 


Research on baby rhesus monkeys shows they choose soft models of their mothers over 
wire ones (Harlow 1959), which has been taken to show the importance of touch (particu- 
larly human touch) for human babies (Lynch 1970; Schneider 1996). Further, neonates in 
pain find comfort in skin-to-skin contact — which introduces a complex of factors, to be 
sure, but all transferred via that pliable skin (Kostandy et al. 2008). The potential, then, 
for an iconicity chain taking us from some relevant touch to the visuals related to that 
touch and then to the meaning 'safety' or love' arises. I believe that potential is realized. 
The sign GUVENLI ‘safe’ in the sign language of Turkey uses two claw handshapes (the 
curled-5-handshape), which rake across the chest out to each side, one above the other 
(shown in the left side of Figure 4). This sign at first has no apparent iconicity. However, 
the sign KÜRK ‘fur coat’ is a compound of the sign for ‘fur’ and the sign for ‘coat’. The 
sign for ‘fur’ has two claw handshapes raking out to each side from the center of the 
face (shown in the right side of Figure 4). 


Figure 4: ‘Safe’ (left) and ‘fur’ (right) in the sign language of Turkey 
The signs differ only by the parameter of location: one is on the chest, the other is on 


the face; one places one hand higher, the other has the hands at the same level. I suggest 
an iconicity chain mapping from the texture of softness onto the visual of fur or hair on 
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the chest then onto the feeling of security and coziness that juvenile mammals (including 
humans) experience as they nestle against the chest of mature mammals (including male 
humans; i.e., daddy keeps us safe). 


5.4.2 Mappings from temperature 


Research on human emotional responses to changing a hand temperature show that 
changes of 6 degrees centigrade are distinctly unpleasant (Salminen et al. 2013). Thus 
one might associate warmth with pleasure/health/life, but extreme heat or cold with dis- 
pleasure/fear/death. Consider here the sign TEAMA ‘fear’ in the sign language of Romania. 
The arms are bent and held close to the body, shaking them as if in a shiver (shown in 
the left side of Figure 5 - this is an embodiment sign, so the signer is shivering). Cer- 
tainly, humans shake not only when they experience cold (where shivers occur because 
the drop in temperature signals the hypothalamus to stimulate muscle contractions to 
warm up the body) but when they experience fear (where adrenaline causes shivers) and 
a number of other emotions. And the sign sTRACH ‘fear’ in the sign language of Poland 
is also built on an image of shaking, but this time it’s the legs that shake (shown in the 
right side of Figure 5 - this is a classifier sign, so the dominant hand represents two legs 
that shake on the palm of the nondominant hand). 


Figure 5: ‘Fear’ in the sign languages of Romania (left) and Poland (right) 


What makes me think the shaking is related directly to coldness in the sign in Ro- 
mania is that this sign is identical to the sign for ‘cold’ in many sign languages (such 
as the sign languages of America, Austria, Britain, Czech Republic, Estonia, Germany, 
India, Italy, Japan, Russia, Spain, Turkey - a group that has three families in it, but also 
several languages that are unrelated to all others in that group). So the iconicity chain 
in the sign language of Romania would go from the sensation of coldness to the visually 
recognizable action of shivering to the emotion of fear (and see discussion on spoken 
languages in this regard in Atkins & Levin 1995). 
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5.4.3 Mappings from weight 


There is little work in the literature regarding synesthesia and weight, however some 
work suggests a synesthetic link between weight and color brightness, where the darker 
a color is, the heavier it is perceived to be (Ward, Banissy & Jonas 2008). Any iconic 
chain of this sort in a sign language would have only one link being cross-modal. 
Given my own experience with the world, however, I suspect other iconicity chains 
regarding weight exist where both links are cross-modal. For example, heavy might be 
associated with strong/substantial and light might be associated with weak/insubstantial 
in a broad range of contexts (such as judging food sources) while in other contexts heavy 
might be associated with dangerous/overwhelming and light with innocuous/possible 
(such as in lifting objects). My speculations might find confirmation. The signs for ‘heavy’ 
and 'strong' are close to identical to each other in the sign languages of Austria and 
Germany (which might or might not be independent facts - see Napoli & Sanders in 
preparation), where the visual image is related to lifting something heavy. In Figure 6 
we see the sign for ‘heavy’ in Austria. In Figure 7 we see the sign for ‘strong’ in Austria. 
The handshapes and locations are the same. The orientation of the palms is upward in 
‘heavy’ and the movement is relatively slow. The orientation of the palms is toward the 
signer in 'strong' and the movement is rapid (which is why the screenshot is blurred). 


Figure 6: ‘Heavy’ in the sign language of Austria 


Further, the signs for ‘lightweight’ and ‘possible’ are similar to one another in the sign 
language of India. 


5.4.4 Mappings from taste 


It appears that humans are born with a negative reaction (disgust) to acidic and a positive 
one to sweet (Fox & Davidson 1986). The sweet response is to taste rather than to a 
high-energy source, since neonates respond equally to sucrose as to artificial sweeteners 
(Ramenghi et al. 1996). The sign for ‘sweet’ is made at (or below or beside) the mouth in 
sign language after sign language, as expected (the only exception on spreadthesign.com 
being the sign language of Estonia, but it looks like the signer actually signed the sense 
of ‘candy’, rather than of the taste ‘sweet’). Perhaps the association of sweetness with 
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Figure 7: ‘Strong’ in the sign language of Austria 


goodness is behind the fact that the sign for ‘good’ is made at the mouth in several 
languages (including those of America, Brazil, and France - all members of a single 
family). Of course, the reason here could as easily be that food and eating in general are 
associated with well-being (at least in that language family). 


5.4.5 Mappings from smell 


Some odors elicit pleasure and some displeasure (Alaoui-Ismaili et al. 1997), which might 
mean 'good/come close’ (the smell of a hot bowl of pasta) versus ‘bad/go away’ (the smell 
of a skunk). Recent work suggests that affective experiences induced by odors are quite 
general, but allow for refinement of types, involving “well-being, social interaction, dan- 
ger prevention, arousal or relaxation sensations, and conscious recollection of emotional 
memories” (Chrea et al. 2009: 49). Iconic chains mapping from odor to a visual form to 
an emotion occur. The sign REPUGNAR ‘disgust’ in the sign language of Brazil is clearly 
built on the perception of a bad smell (and is similar to CHTERO ‘odor’), as is the sign 
ASQUEAR 'odor' in the sign language of Spain. In Figure 8 we see the Brazil sign, where 
the hand moves to the nose, then out to the ipsilateral side, then down and across to the 
contralateral side. 


Figure 8: ‘Disgust’ in the sign language of Brazil 
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6 Conclusion 


Understanding iconicity in sign languages requires attention to cross-modal associations, 
which might be complex - involving an iconicity chain that has two mappings, rather 
than one. Thus the present work aims to alert researchers to such associations. 

The benefits of a more (nearly) adequate approach to iconicity in communication are 
several. Understanding iconicity can help separate out what language is responsible for 
from what general communication is responsible for, and it might help us understand 
some of the more thorny areas in sign language studies, including why the signs for 
abstract concepts can be similar in unrelated languages. 

Of particular interest to me is how languages change over time. Sign languages lack 
the regular changes typical of spoken language change (as discussed in any classical 
or more recent introduction to historical linguistics, such as Anttila 1972) but, instead, 
display only tendencies (Moser 1990). Thus diachronic changes motivated by the drive 
for ease of articulation (Napoli, Sanders & Wright 2014; Sanders & Napoli 2016b; Sanders 
& Napoli 2016a) apply erratically. The (near) insistence on maintaining iconicity might 
be at least partially responsible here, especially in light of the fact that iconic words 
in spoken language sometimes do not obey the same constraints that non-iconic words 
obey (as noted in Meir 2010). For example, well-formedness conditions and ordinary 
phonological rules do not always apply to Japanese mimetics (Itó & Mester 1995). And the 
onomatopoetic word peep in English (pipen in Middle English) did not undergo the Great 
Vowel Shift perhaps because of the desire to maintain the [i] which connects to small 
size (Hock 1986: 294). Additionally, in West Papuan languages it looks like onomatopoeia 
in words meaning ‘chicken’ and ‘cat’ may have contributed both to the spread of these 
words among genetically unrelated languages and to their resistance to phonological 
change over time (Gasser in preparation). We find further evidence of onomatopoeia 
being ill-behaved in Italian (Marina Nespor, personal communication, December 2016). 
The horse in Italian goes iiii (that is [iiii], while the verb for this has consonants between 
the vowels: nitrire), and many parents reading to small children might enunciate four 
syllables here. But in the rest of the lexicon there are no words that contain a sequence of 
four uninterrupted vowels, except perhaps in other animal sounds. For example, the goat 
or sheep might go [be], [bee], [beee], or [beeee] (while the verb for this is well-formed: 
belare). Likewise, the mouse goes [skwit skwit], but elsewhere in Italian words do not 
end in [t], nor do syllables unless the [t] is part of a geminate (and, again, the verb for 
this is well-formed: squittire). And in the Turkic language Kazakh, we find onomatopoeic 
words with consonant clusters that fall outside the range of normally attested clusters 
(Washington in preparation). In sum, iconicity allows an alignment between form and 
meaning, and the pressure to establish it when it is possible and the subsequent pressure 
to maintain it once it is established are strong. 
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Since the late 1970s, crosslinguistic variation has generally been handled by means of UG- 
specified parameters. On the positive side, thinking of variation in terms of parameterized 
principles unleashed an unprecedented amount of work in comparative syntax, leading to 
the discovery of heretofore unknown morphosyntactic phenomena and crosslinguistic gen- 
eralizations pertaining to them. On the negative side, however, both macroparameters and 
microparameters have proven themselves to be empirically inadequate and conceptually 
nonminimalist. Alternatives to parameters are grounded approaches, epigenetic approaches, 
and reductionist approaches, the last two of which seem both empirically and conceptually 
quite promising. 


1 Introduction 


The existence of crosslinguistic variation has always been problematic for syntacticians. 
If there is a universal grammar, one might ask, then why aren’t all languages exactly 
the same? In the earliest work in generative syntax, characterizing the space in which 
languages could differ, whether at the surface or at a deep level, was not a priority. At 
the time, surface differences between languages and dialects were generally attributed 
to language-particular rules or filters. 

In the late 1970s, however, a strategy was developed that allowed the simultaneous 
development of a rich theory of Universal Grammar (UG) along with a detailed account 
of the limits of crosslinguistic morphosyntactic variation. In this view, syntactic com- 
plexity results from the interaction of grammatical subsystems, each characterizable in 
terms of its own set of general principles. The central goal of syntactic theory now be- 
came to identify such systems and to characterize the degree to which they might vary 
(be “parameterized”) from language to language. Chomsky (1995) describes succinctly 
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how such variation might be accounted for in what, by the early 1980s, was called the 
“principles-and-parameters” (P&P) approach. 


Within the P&P approach the problems of typology and language variation arise in 
somewhat different form than before. Language differences and typology should 
be reducible to choice of values of parameters. A major research problem is to 
determine just what these options are, and in what components of language they 
are to be found. (Chomsky 1995: 6) 


The first mention of parameters, I believe, was in Chomsky (1976): 


Even if conditions are language- or rule-particular, there are limits to the possible 
diversity of grammar. Thus, such conditions can be regarded as parameters that 
have to be fixed (for the language, or for the particular rules, in the worst case), in 
language learning ... It has often been supposed that conditions on applications of 
rules must be quite general, even universal, to be significant, but that need not be 
the case if establishing a “parameteric” condition permits us to reduce substantially 
the class of possible rules. (Chomsky 1976: 315) 


An interesting question is why Chomsky at this point would propose parameters, 
since there is nothing in his 1976 paper that suggests that they need to be incorporated 
into the theory. A possible answer is that in the same year an MIT dissertation appeared 
(Kim 1976) that showed that Korean obeys a form of the Tensed-S-Condition, even though 
Korean does not distinguish formally between tensed and non-tensed clauses. That fact 
might have planted the seed for the idea of parameterized principles. At around the 
same time, an “external” inspiration for parameters was provided by the work of Jacques 
Monod and Francois Jacob (Monod 1972; “Darwinism reconsidered”). Their idea was 
that slight differences in timing and arrangement of regulatory mechanisms that acti- 
vate genes could result in enormous differences. Berwick & Chomsky (2011: 28) has 
claimed that “Jacob’s model in turn provided part of the inspiration for the Principles 
and Parameters (P&P) approach to language ..”” 

Whatever the direct inspiration for parameterized principles might have been, their 
adoption triggered an unprecedented explosion of work in comparative syntax. One un- 
questionably positive consequence of the P&P approach to linguistic theory was to spur 
investigation of a wide variety of languages, particularly those with structures markedly 
different from some of the more familiar Western ones. The explanation for this is 
straightforward. In earlier transformational grammar (oversimplifying somewhat), one 
worked on the grammar of English, the grammar of Thai, the grammar of Cherokee, and 
so on, and attempted to extract universal properties of grammars from the principles 
one found in common among these constructed grammars. But now the essential unity 
of all grammars, within the limits of parametric variation, was taken as a starting point. 
One could not even begin to address the grammar of some language without asking the 
question of how principles of Case, binding, bounding, and so on are parameterized in 
that language. And that in turn demanded that one have a rough feel for the degree of 
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parameterization possible for the principle. As Chomsky noted, to delimit the domain 
of core grammar, we “rely heavily on grammar-internal considerations and comparative 
evidence, that is, on the possibilities for constructing a reasonable theory of UG and con- 
sidering its explanatory power in a variety of language types, with an eye open to the 
eventual possibility of adducing evidence of other kinds” (Chomsky 1981: 9). 

The core idea of the P&P approach is that both the principles of UG and the possible 
parameter settings are part of our genetic endowment: 


[W]hat we “know innately” are the principles of the various subsystems of Sọ [= 
the initial state of the language faculty — FJN] and the manner of their interaction, 
and the parameters associated with these principles. What we learn are the values 
of these parameters and the elements of the periphery (along with the lexicon, to 
which similar considerations apply). The language that we then know is a system 
of principles with parameters fixed, along with a periphery of marked exceptions. 
(Chomsky 1986: 150-151) 


The original idea was that there are a small number of parameters and small number of 
settings. This idea allowed two birds to be killed with one stone. Parametric theory could 
explain the rapidity of acquisition, given the poor input, and explain the crosslinguistic 
distribution of grammatical elements. As Norbert Hornstein noted: 


The second reason in favor of parameter setting models has been their ability to 
provide (at least in principle) an answer to Plato's Problem [the fact that we know 
so much about language based on so little direct evidence - FJN]. The idea is that 
construing language acquisition as parameter setting eases the problem faced by 
the child, for setting parameter values is easier than learning the myriad possible 
rules of one's native language. In other words, the PLD [- Primary Linguistic Data 
- FJN] can be mined for parameter values more easily than it can be for rules. 
(Hornstein 2009: 165) 


The need to base a theory of parametric variation on the investigation ofa wide variety 
of languages resulted in what Bernard Comrie, always a major critic of the generative 
approach, referred to approvingly as “one of the most interesting recent developments 
in linguistic typology ... the entry of generative syntax into the field" (Comrie 1988: 458). 
Comparative studies of the distribution of null-subjects, binding domains, configura- 
tionality, and so on became routine by the 1980s and provided a generative interpreta- 
tion of the kind of crosslinguistic typological studies that were initiated by the work of 
Joseph Greenberg. In this regard, it is instructive to observe Chomsky's changing rhetor- 
ical evaluation of Greenbergian typological work. His first reference to Greenberg was 
somewhat dismissive, noting that "Insofar as attention is restricted to surface structures, 
the most that can be expected is the discovery of statistical tendencies, such as those 
presented by Greenberg (1963)" (Chomsky 1965: 118). In 1981, Chomsky offered what 
was perhaps his first favorable reference to this line of research: 


Universals of the sort explored by Joseph Greenberg and others have obvious rele- 
vance to determining just which properties of the lexicon have to be learned in this 
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manner in particular grammars — and to put it in other terms just how much has to 
be learned as grammar develops in the course of language acquisition. (Chomsky 
1981: 95) 


By 1982 he was writing that "Greenbergian universals ... are ultimately going to be 
very rich. ... They have all the difficulties that people know, they are "surfacy; they are 
statistical, and so on and so forth, but nevertheless they are very suggestive" (Chomsky 
1982: 111). And in 1986, they are "important, ... yielding many generalizations that require 
explanation ...” (Chomsky 1986: 21). 

In this paper, I do not question the fertility of the research program that was launched 
by the P&P approach. What I do is to provide a critical review of the various approaches 
that have been taken to parameters since the late 1970s, discussing their conceptual 
strengths and weaknesses. Given space limitations, my overview will in places be un- 
avoidably somewhat superficial. The paper is organized as follows. Sections 2 through 5 
outline various approaches that have been taken with respect to parameters: UG-principle- 
based, microparametric, macroparametric, and interface-based, respectively. Some of 
the major conceptual and empirical problems with the classical view of parameters are 
outlined in §6, and §7 discusses alternatives to the classical approach. §8 is a brief con- 
clusion. 


2 Parameterized UG principles 


All of the subsystems of principles in the Government-Binding Theory were assumed to 
be parameterized. Consider a few concrete examples: 


(1) Examples of parameterized UG principles: 

a. BINDING (Lasnik 1991). Principle C is parameterized to allow for sentences of 
the form john; thinks that John; is smart in languages like Thai and Vietnamese. 

b. GOVERNMENT (Manzini & Wexler 1987). The notion “Governing Category” 
is defined differently in different languages. 

c. BOUNDING (Rizzi 1982). In English, NP and S are bounding nodes for Subja- 
cency, in Italian NP and S’. 

d. X-BAR (“Origins of phrase structure”). In English, heads precede their comple- 
ments; in Japanese heads follow their complements. 

e. CASE and THETA-THEORY (Travis 1989). Some languages assign Case and/or 
Theta-roles to the left, some to the right. 


Fewer and fewer parameterized UG principles have been proposed in recent years 
for the simple reason that there are fewer and fewer widely accepted UG principles. 
The thrust of the Minimalist Program (MP) has been to reduce the narrow syntactic 
component and to reinterpret broad universal principles as economy effects of efficient 
computation. Economy principles are generally assumed not to be parameterized: 
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There is simply no way for principles of efficient computation to be parameter- 
ized [...], it strikes me as implausible to entertain the possibility that a principle 
like “Shortest Move” could be active in some languages, but not in others. Put 
differently, [...] there can be no parameters within the statements of the general 
principles that shape natural language syntax. (Boeckx 2011: 210) 


On the same page Boeckx proposes the “Strong Uniformity Thesis”: Principles of nar- 
row syntax are not subject to parameterization; nor are they affected by lexical parame- 
ters. 

It should be noted that the very idea of looking for principles of UG has fallen into 
disrepute in recent years. For example, Chomsky has attributed to them what can only 
be described as negative qualities: 


[T]ake the LCA (Linear Correspondence Axiom) [Kayne 1994]. If that theory is 
true, then the phrase structure is just more complicated. Suppose you find out that 
government is really an operative property. Then the theory is more complicated. 
If ECP really works, well, too bad; language is more like the spine [i.e., poorly 
designed - FJN] than like a snowflake [i.e., optimally designed]. (Chomsky 2002: 
136) 


So if in theory there are very few UG principles and no parameters associated with 
them, then the question is where and how to capture systematic crosslinguistic variation. 
Given the organization of grammars in a P&P-type model, the simplest assumption to 
make is that one group of languages contains a particular feature attraction mechanism 
that another group lacks, thus allowing the presence or absence of this mechanism to 
divide languages into typological classes. Some early examples can be illustrated by 
whether or not a feature setting determines whether V moves to I in a particular language 
(Emonds 1978; Pollock 1989), whether V moves to C (to derive V2 order) (Besten 1977), 
and whether N incorporates into V (Baker 1988). 

A major debate within parametric theory has centered on the host of the attracting 
feature. In “microparametric” approaches, the locus of variation lies in individual func- 
tional heads. “Macroparametric” approaches are not so restricted. They will be discussed 
in §3 and §4 respectively. 


3 The Borer-Chomsky Conjecture and microparametric 
approaches 


Hagit Borer, in Parametric Syntax (Borer 1984), made two proposals, which she may or 
may not have regarded as variants of each other. One is that parameters are restricted 
to the idiosyncratic properties of lexical items, the other that they are restricted to the 
inflectional system. Borer wrote: 


..interlanguage variation would be restricted to the idiosyncratic properties of lexi- 
cal items. These idiosyncrasies, which are clearly learned, would then interact with 
general principles of UG in a particular way. (Borer 1984: 2-3) 
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By way of example, she discussed a rule that inserts a preposition in Lebanese Arabic 
- a rule that does not exist in Hebrew: 


(2): gas > la / [pp ... NP] 


Along the same lines, Manzini & Wexler (1987) pointed to language-particular anaph- 
ors that have to be associated with parameters: cakicasin and caki in Korean; sig and 
hann in Icelandic. 

Now every language has thousands of lexical items, but nobody ever entertained the 
possibility that every lexical item might be a locus for parametric variation. Borer's 
proposal that only inflectional elements provide the locus for parametric variation was 
designed to forestall this possibility. In the same book she wrote: 


It is worth concluding this chapter by reiterating the conceptual advantage that 
reduced all interlanguage variation to the properties of the inflectional system. The 
inventory of inflectional rules and of grammatical formatives in any given language 
is idiosyncratic and learned on the basis of input data. (Borer 1984: 29) 


The restriction of parameters to the inflectional system is a somewhat different pro- 
posal than their restriction to lexical items. After all, not all lexical items are part of the 
inflectional system and not all inflections are lexical. However, Borer took "inflectional" 
in a pretty broad sense, namely, to encompass Case and agreement relations, theta-role 
assignment, and so on. She recognized an immediate problem here: Inflection-based pa- 
rameters could not handle some of the best known cases of crosslinguistic variation such 
as differences in head-order and extraction possibilities. 

In any event, the hypothesis that the locus of parametric variation is restricted to 
exclude major lexical categories came to be known as the “Borer-Chomsky Conjecture”. 

Borer (1984) appeared before the distinction between lexical and functional categories 
had been elaborated. Once this distinction had become well accepted, it seemed natural 
to associate parameters with functional heads, rather than with inflectional items. This 
idea was first proposed as The Functional Parameterization Hypothesis (FPH) in Fukui 
(1988). In this view, only functional elements in the lexicon (that is, elements such as 
Complementizer, Agreement, Tense, etc.) are subject to parametric variation. 

It is important to stress that FPH is not a simple extension of the idea that parameters 
are inflection-located. There have been countless functional categories proposed that 
have nothing to do with inflection, no matter how broadly this concept is interpreted. 
So adverbs, topic, focus, and so on are typically thought to be housed in functional cate- 
gories, even though they are not in many languages “inflectional”. 

Associating parameters with functional heads has been claimed to have both method- 
ological and theoretical advantages. Methodologically, it allows “experiments” to be 
constructed comparing two closely-related variants, thereby pinpointing the possible 
degree of variation. The ideal situation then would be to compare speech varieties that 
differ from each other only in terms of (most ideally) one or, failing that, only a few 
variables. Richard Kayne remarks: 


! Fukui himself exempted ordering restrictions from this hypothesis. 
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If it were possible to experiment on languages, a syntactician would construct an 
experiment of the following type: take a language, alter a single one of its observ- 
able syntactic properties, examine the result and see what, if any, other property 
has changed as a consequence. If some property has changed, conclude that it and 
the property that was altered are linked to one another by some abstract parameter. 
Although such experiments cannot be performed, I think that by examining pairs 
(and larger sets) of ever more closely related languages, one can begin to approxi- 
mate the results of such an experiment. To the extent that one can find languages 
that are syntactically extremely similar to one another, yet clearly distinguishable 
and readily examinable, one can hope to reach a point such that the number of ob- 
servable differences is so small that one can virtually see one property covarying 
with another. (Kayne 2000: 5-6) 


In other words, in Kayne’s view, this “microparametric variation” (as he called it) is 
the best testing ground for the hypothesis that syntactic variation can be reduced to a 
finite set of parameters. 

Along more theoretical lines, it has been claimed that functional-category-situated 
microparameters impose a strong limit on what can vary, crosslinguistic differences 
now being reduced to differences in features, thereby restricting learning to the lexi- 
con (Kayne 2000; Roberts 2010; Thornton & Crain 2013)? Indeed, Chomsky has often 
asserted that microparameters are necessary in order to solve Plato’s Problem: 


Apart from lexicon, [the set of possible human languages] is a finite set, surpris- 
ingly; in fact, a one-membered set if parameters are in fact reducible to lexical 
properties [associated with functional categories — FJN] ... How else could Plato's 
problem be resolved? (Chomsky 1991: 26) 


4 Macroparameters 


Not all minimalists have embraced the Borer-Chomsky Conjecture and consequent turn 
to microparameters. Mark Baker, in particular, while not denying that there are micro- 
level points of variation between languages, has defended what he calls “macroparam- 
eters” (Baker 1996), that is, parametric differences that cannot be localized in simple 
differences in attracting features of individual functional heads. He gives as examples, 
among others, the Head Directionality Parameter (i.e. VO vs. OV), where functional cat- 
egories play no obvious role, the Polysynthesis Parameter, which in his account refers 
to the lexical category “Verb”, and an agreement parameter (Baker 2008) distinguishing 
Niger-Congo languages from Indo-European languages, which, in opposition to a strong 
interpretation of the Borer-Chomsky Conjecture, applies to the full range of functional 
categories. Another example of a macroparameter is the compounding parameter of Sny- 
der (2001: 328), which divides languages into those that allow formation of endocentric 


? As an anonymous reviewer points out, this claim is highly dependent on the nature of the features and the 
role that they play in the system. 
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compounds during the syntactic derivation and those that do not. The NP/DP macropa- 
rameter of Bošković & Gajewski (2011) distinguishes "NP languages”, which lack articles, 
permit left-branch extraction and scrambling, but disallow NEG-raising, from “DP lan- 
guages”, which can have articles, disallow left-branch extraction and scrambling, but 
allow NEG-raising. And Huang (2007) points to many features that distinguish Chinese- 
type languages from English-type languages, including a generalized classifier system, 
no plural morphology, extensive use of light verbs, no agreement, tense, or case morph- 
ology, no overt wh-movement, and radical pro-drop. 

Baker and other advocates of macroparameters share the conviction long held by ad- 
vocates of holistic typology that languages can be partitioned into macro-scope broad 
classes, typically (or, at least, ideally) where the setting of one feature entails a cascade of 
shared typological properties. As Baker puts it, “the macroparametric view is that there 
are at least a few simple (not composite) parameters that define typologically distinct 
sorts of languages” (Baker 2008: 355). 


5 Parameters as being stated at the interfaces 


Under the perspective that parameters are stated at the interfaces, lexical items are sub- 
ject to a process of generalized late insertion of semantic, formal, and morphophono- 
logical features after the syntax, which is where all variation would take place. Or, as 
another possibility, the parametric differences would derive from the way in which such 
features are interpreted by the interfaces or by processes that manipulate the features 
on the path from spell out to the interfaces. There has been some debate as to whether 
there is parametric variation at the Conceptual-Intentional (C-I) interface. Angel Gallego 
remarks: 


... it would be odd for semantic features to be a source of variation, which leaves us 
with formal and morphophonological features as more likely suspects. ... Consid- 
ered together, the observations by Chomsky (2001) and Kayne (2005; 2008) appear 
to place variation in the morphophonological manifestation of closed classes (i.e. 


” 


functional categories, which contain unvalued features)? (Gallego 2011: 543-544) 


However, for Ramchand & Svenonius (2008) the narrow syntax provides a “basic skele- 
ton” to C-I, but languages vary in terms of how much their lexical items explicitly encode 
about the reference of variables like T, Asp, and D. 


6 Conceptual and empirical problems with parameters 
Before moving on to nonparametric approaches to variation, it would be useful to high- 


light some of the main problems with the classic view of parameters as being innately- 
provided grammatical constructs (for an earlier discussion, see Newmeyer 2005). 
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6.1 No macroparameter has come close to working 


The promise of parameters in general and macroparameters in particular is that from 
the interaction of a small number of simply-stated parameters, the vast complexity of 
human language morphosyntax might be derived. As Martin Haspelmath put it: 


According to the principles and parameters vision, it should be possible at some 
point to describe the syntax of a language by simply specifying the settings of all 
syntactic parameters of Universal Grammar. We would no longer have any need 
for thick books with titles like The Syntax of Haida (cf. Enrico 2003’s 1300-page 
work), and instead we would have a simple two-column table with the parame- 
ters in the first column and the positive or negative settings in the second column. 
(Haspelmath 2008: 80) 


Needless to say, nothing remotely like that has been achieved. The problem is that “few 
of the implicational statements at the heart of the traditional Principles-and-Parameters 
approach have stood the test of time” (Boeckx 2011: 216). The clustering effects are simply 
not very robust. The two most-studied macroparameters, I believe, are the Null Subject 
(Pro-drop) and the Subjacency parameters, neither of which is much evoked in recent 
work. As for the former: “History has not been kind to to the Pro-drop Parameter as orig- 
inally stated” (Baker 2008: 352). And Luigi Rizzi notes that “In retrospect, [subjacency 
effects] turned out to be a rather peripheral kind of variation. Judgments are complex, 
graded, affected by many factors, difficult to compare across languages, and in fact this 
kind of variation is not easily amenable to the general format of parameters ..? (Rizzi 
2014: 16). 


6.2 “Microparameter” is just another word for “language-particular 
rule” 


Let’s say that we observe two Italian dialects, one with a do-support-like structure and 
one without. We could posit a microparametric difference between the dialects, perhaps 
hypothesizing that one contains an attracting feature that leads to do-support and one 
that does not. But how would such an hypothesis differ in substance from saying that 
one dialect has a rule of do-support that the other one lacks? Indeed, Norbert Hornstein 
has stressed that “microparameter” is just another words for "rule"? 
Last of all, if parameters are stated in the lexicon (the current view), then para- 
metric differences reduce to whether a given language contains a certain lexical 
item or not. As the lexicon is quite open ended, even concerning functional items 
as a glance at current cartographic work makes clear, the range of variation be- 
tween grammars/languages is also open ended. In this regard it is not different 
from a rule-based approach in that both countenance the possibility that there is 
no bound on the possible differences between languages. (Hornstein 2009: 165) 


3 See Rizzi (2014: 22-27) for a defense of the idea that microparameters are not merely rules under a different 
name and Boeckx (2014) for a reply to Rizzi. 
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Michal Starke has made a similar observation: 


Thirty years ago, if some element moved in one language but not in another, a 
movement rule would be added to one language but not to the other. Today, a 
feature “I want to move" (“EPP’, “strength’, etc.) is added to the elements of one 
language but not of the other. In both cases (and in all attempts between them), 
variation is expressed by stipulating it. Instead of a theory, we have brute force 
markers. (Starke 2014: 140) 


6.3 There would have to be hundreds, if not thousands, of parameters 


Tying parameters to functional categories was a strong conjecture in the 1980s, since 
there were so few generally recognized functional categories at the time. There were so 
few, in fact, that it was easy to believe that only a small number of parameters would 
be needed. Pinker (1994: 112), for example, speculated that there are just “a few men- 
tal switches’. Lightfoot (1999: 259) suggested that there are about 30 to 40 parameters. 
For Adger (2003: 16), "Ihere are only a few parameters’. Roberts & Holmberg (2005) in- 
creased the presumed total to between 50 and 100. Fodor (2001: 734) was certainly correct 
when she observed that “it is standardly assumed that there are fewer parameters than 
there are possible rules in a rule-based framework; otherwise, it would be less obvious 
that the amount of learning to be done is reduced in a parametric framework’. At this 
point in time, many hundreds of parameters have been proposed. Gianollo, Guardiano 
& Longobardi (2008) propose 47 parameters for DP alone on the basis of 24 languages, 
only five of which are non-Indo-European, and in total representing only 3 families. Lon- 
gobardi & Guardiano (2011) up the total to 63 binary parameters in DP. As Cedric Boeckx 
has stressed: “It is not at all clear that the exponential growth of parameters that syntac- 
ticians are willing to entertain is so much better a situation for the learner than a model 
without parameters at all” (Boeckx 2014: 157). 

One way to circumvent this problem would be to posit nonparametric differences 
among languages, thereby maintaining the possibility of a small number of parameters. 
Let us examine this idea now. 


6.4 Nonparametric differences among languages undercut the entire 
parametric program 


Are all morphosyntactic differences among languages due to differences in parameter 
setting? Generally that has been assumed not to be the case. Charles Yang was expressing 
mainstream opinion when he wrote that ^... it seems highly unlikely that all possibilities 
of language variation are innately specified ..? (Yang 2011: 191). From the beginning of 
the parameteric program it has been assumed that some features are extraparametric. 
Outside of (parametrically relevant) core grammar are: 


... borrowings, historical residues, inventions, and so on, which we can hardly ex- 
pect to - and indeed would not want to - incorporate within a principled theory 
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of UG. ... How do we delimit the domain of core grammar as distinct from marked 
periphery? ... [We] rely heavily on grammar-internal considerations and compar- 
ative evidence, that is, on the possibilities for constructing a reasonable theory 
of UG and considering its explanatory potential in a variety of language types ... 
(Chomsky 1981: 8-9) 


In other words, some language-particular features are products of extraparametric 
language-particular rules. Consider, for example, the treatment of Hixkaryana in Baker 
(2001), based on an earlier proposal in Kayne (1994). This language for the most part 
manifests OVS word order: 


(3) Hixkaryana (Derbyshire 1985) 
Kanawa yano toto 
canoe took person 


"[he man took the canoe: 


One's first thought might be that what is needed is a parameter allowing for OVS 
order. But in fact Baker rejects the idea that a special word order parameter is involved 
here. Rather, he argues that Hixkaryana is (parametrically) SOV and allows the fronting 
of VP by a movement rule: 


(4) S[OV] — [OV]S 


In other words, in this account word order is determined both by a parameter and a 
language-specific rule. 

It is quite implausible that every syntactic difference between languages and dialects 
results from a difference in parameter settings. Consider the fact that there are several 
dozen systematic morphosyntactic differences between the Norfolk dialect and standard 
British English (Trudgill 2003), most of which appear to be analytically independent. If 
each were to be handled by a difference in parameter setting, then, extrapolating to all 
of the syntactic distinctions in the world's languages, there would have to be thousands 
- if not millions — of parameters. That is obviously an unacceptable conclusion from 
an evolutionary standpoint, given that the set of parameters and their possible settings 
is, by hypothesis, innate. Furthermore, many processes that can hardly be described as 
"marginal" have been assumed to apply in PF syntax (where the standard view, I believe, 
is that parameters are not at work), including extraposition and scrambling (Chomsky 
1995); object shift (Holmberg 1999; Erteschik-Shir 2005); head movements (Boeckx & 
Stjepanovic 2001); the movement deriving V2 order (Chomsky 2001); linearization (i.e. 
VO vs. OV) (Chomsky 1995; Takano 1996; Fukui & Takano 1998; Uriagereka 1999); and 
even Wh-movement (Erteschik-Shir 2005). 

Ithink that it is fair to say that, after 35 years of investigation, nobody has a clear idea 
about which syntactic differences should be considered parametric and which should 
not be.* But one thing seems clear: If learners need to learn rules anyway, very little is 
gained by positing parameters. 


^ But see Smith & Law (2009) for an interesting discussion of criteria for distinguishing parametric and 
nonparametric differences. 
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6.5 Parametric theory is arguably inherently unminimalist 


There are a number of ways that the assumptions of the Minimalist Program have en- 
tailed a rethinking of parameters and the division of labor among the various compo- 
nents for the handling of variation. In one well-known formulation, “FLN [= the faculty 
of language in the narrow sense - FJN] comprises only the core computational mecha- 
nisms of recursion as they appear in narrow syntax and the mapping to the interfaces' 
(Hauser, Chomsky & Fitch 2002: 1573). Where might parameters fit into such a scenario? 
In one view, “... if minimalists are right, there cannot be any parameterized principles, 
and the notion of parametric variation must be rethought. (Boeckx 2011: 206). That is, 
given that the main thrust of the minimalist program is the reduction to the greatest 
extent possible of the elements of UG, there would seem to be no place for innately- 
specified parameters. 

Despite the above, a great deal of work within the general envelope of the MP is still 
devoted to fleshing out parameters, whether micro or macro. For example, Yang (2011: 
202-203) writes that “a finite space of parameters or constraints is still our best bet on 
the logical problem of language acquisition’. Note also that in many approaches, “the 
mapping to the interfaces' encompasses a wide variety of operations. To give one exam- 
ple, “UG makes available a set F of features (linguistic properties) and operations Cyr, 
... that access F to generate expressions" (Chomsky 2000: 100). In addition to features 
and the relevant operations on them, minimalists have attributed to the narrow syntax 
principles governing agreement, labelling, transfer, probes, goals, deletion, and econ- 
omy principles such as Last Resort, Relativized Minimality (or Minimize Chain Links), 
and Anti-Locality. None of these fall out from recursion per se, but rather represent con- 
ditions that underlie it or that need to be imposed on it. To that we can add the entire set 
of mechanisms pertaining to phases, including what nodes count for phasehood and the 
various conditions that need to be imposed on their functioning, like the Phase Impen- 
etrability Condition. And then there is the categorial inventory (lexical and functional), 
as well as the formal features they manifest. The question, still unresolved, is whether 
any of these principles, conditions, and substantive universals could be parameterized, 
in violation of the Strong Uniformity Thesis, but not of weaker proposals. If so, that 
would seem to allow for parametric variation to be manifested in the journey towards 
the interfaces. 


7 Alternatives to the classic Principles-and-Parameters 
model 
Chomsky (2005) refers to “the three factors in language design’, namely, genetic endow- 


ment, experience, and principles not specific to the faculty of language. The last-named 
"third factor explanations’, which include principles of data analysis and efficient com- 
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putation, among other things, provide a potential alternative to the nonminimalist polif- 
eration of parameters and their settings." 

The following subsections discuss alternatives to the classic P&P model, all appealing 
to one degree or another to third factor explanations. They are grounded approaches 
(87.1), epigenetic (or emergentist) approaches (87.2), and reductionist approaches (87.3). 


7.1 Grounded approaches 


A grounded approach is one in which some principle of UG is grounded in - that is, 
ultimately derived from - some third factor principle. Along these lines, a long tradition 
points to a particular constraint, often an island constraint, and posits that it is a gram- 
maticalized processing principle. One of the first publications to argue for grounded 
constraints was Fodor (1978), where two island constraints are posited, one of which is 
the Nested Dependency Constraint (NDC): 


(5) The Nested Dependency Constraint: If there are two or more filler-gap 
dependencies in the same sentence, their scopes may not intersect if either 
disjoint or nested dependencies are compatible with the well-formedness 
conditions of the language. 


As Fodor noted, the processing-based origins of this constraint seem quite straightfor- 
ward. 

Another example is the Final Over Final Constraint (FOFC), proposed originally in 
Holmberg (2000): 


(6) Final Over Final Constraint: If œ is a head-initial phrase and f is a phrase 
immediately dominating a, then D must be head-initial. If « is a head-final 
phrase, and f is a phrase immediately dominating o, then f can be head-initial or 


head-final. 


As one consequence of the FOFC, there are COMP-TP languages that are verb-final, 
but there are no TP-COMP languages that are verb-initial. Holmberg and his colleagues 
interpret this constraint as the following UG principle: 


(7) A theoretical reinterpretation of the FOFC: If a phase-head PH has an 
EPP-feature, then all the heads in its complement domain from which it is 
nondistinct in categorial features must have an EPP-feature. (Biberauer, 
Holmberg & Roberts 2008: 13) 


Walkden (2009) points out that FOFC effects are accounted for by the processing the- 
ory developed in Hawkins (2004) and hence suggests that (7) is a good example of a 
grounded UG principle. 


5 In what follows, I consider classical functional explanations of grammatical structure to be of the third 
factor type. It is not clear whether Chomsky shares that view. 
D Mobbs (2014) builds practically all of Hawkins’s parsing theory into UG. 
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Note that neither Fodor nor Walkden have reduced the number of UG constraints; 
they have merely attributed the origins of these constraints to what in Chomsky’s ac- 
count would be deemeed a third factor. Naturally, the question arises as to whether 
these principles would need to be parameterized. The answer is “apparently so’, since 
the NDC does not govern Swedish grammar (Engdahl 1985: 75) and the FOFC is not at 
work in Chinese (Chan 2013). In other words, grounded approaches, whatever intrinsic 
interest they might possess, do not prima facie reduce the number of UG principles and 
parameters. 


7.2 Epigenetic approaches 


Let us turn now to “epigenetic’ or “emergentist” approaches to variation, where param- 
eters are not provided by an innate UG. Rather, parametric effects arise in the course 
of the acquisition process through the interaction of certain third factor learning biases 
and experience. UG creates the space for parametric variation by leaving certain features 
underspecified. There are several proposals along these lines, among which are Gianollo, 
Guardiano & Longobardi (2008); Boeckx (2011); and Biberauer et al. (2014) (preceded by 
many papers by the same four authors). For reasons of space, I focus exclusively on 
Biberauer et al. (2014). In their way of looking at things, the child is conservative in the 
complexity of the formal features that it assumes are needed (what they call “feature 
economy’) and liberal in its preference for particular features to extend beyond the in- 
put (what they call “input generalization” and which is a form of the superset bias). The 
idea is that these principles drive acquisition and thus render parameters unnecessary, 
while deriving the same effects. Consider first their word order hierarchy, represented 
in Figure 1: 


a. Is head-final present? 


Ecc MEG 


No: head-initial b. Yes: Present on all heads? 


M dlc es 


Yes: head-final No: present on [+V] heads? 


Rm c c 


Yes: head-final in the clause only d. No: present on ... 
Figure 1: The Word Order Hierarchy of Biberauer et al. (2014: 110) 


To illustrate with a made up example, let's say that a language is consistently head- 
initial except in NP, where the noun follows its complements. However, there is a defin- 
able class of nouns in this language that do precede their complements and a few nouns 
in this language behave idiosyncratically in terms of the positioning of their specifiers 
and complements (much like the English word enough, which is one of the few degree 
modifiers that follows the adjective). In their theory, the child will go through the fol- 
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lowing stages of acquisition, zeroing in step-by-step on the adult grammar. First it will 
assume that all phrases are head-initial, even noun phrases. Second, it will assume that 
all NPs are head-final. Third, it will learn the systematic class of exceptions to the latter 
generalization, and finally, it will learn the purely idiosyncratic exceptions. 

The other hierarchies proposed in Biberauer et al. are more complex and depend on 
many assumptions about the feature content of particular categories. Consider for exam- 
ple their null argument hierarchy and the questions posed by the child in determining 
the status of such arguments in its grammar (Figure 2). 


a. Are ug-features present 
on probes? 


Ec o m 


No: Yes: 
Radical pro-drop Are ug-features present 
on all probes? 


Yes: No: 
Pronominal arguments Are ug-features fully specified 
on some probes? 


ee 


No: Yes: 
Non-prodrop Are ug-features fully specified 
on T? 
Yes: No: 


Consistent null subject 
Figure 2: The Null Argument Hierarchy of Biberauer et al. (2014: 112) 


There are many issues that one might raise about this acquisition scenario, the most 
important of which being whether children proceed from general to the particular, cor- 
recting themselves as they go, and gradually zero in on the correct grammar. Indeed, 
many acquisitionists argue for precisely the reverse set of steps, in which children learn 
idiosyncratic items before broad generalizations. Theresa Biberauer herself acknowl- 
edges (p.c., November 21, 2014) that these steps can be overridden in certain circum- 
stances. For example, the early stages might correspond to a pre-production stage or 
possibly acquirers pass through certain stages very quickly if counterevidence to an ear- 
lier hypothesis is readily available. And finally, as they note, frequency effects can distort 
the smooth transition through the hierarchies. 

One might also ask how much the child has to know in advance of the progression 
through the hierarchy. For example, given their scenario, the child has to know to ask 
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“Are unmarked phi-features fully specified on some probes?” That implies a lot of gram- 
matical knowledge. Where, one might ask, does this knowledge come from and how 
does the child match this knowledge with the input?" 

Despite these unresolved questions, the Biberauer et al. approach presents a view that 
preserves the major insights of parametric theory without positing UG-based parame- 
ters. As such, it needs to be taken very seriously. 


7.3 Reductionist approaches and the need for language-particular 
rules 


Reductionist approaches differ from epigenetic ones by reducing still further the role 
played by an innate UG in determining crosslinguistic variation. For example, returning 
to the FOFC, Trotzke, Bader & Frazier (2013) provide evidence that the best motivated 
account is to remove it entirely from the grammar, since, in their view, it can be explained 
in its entirety by systematic properties of performance systems. They also deconstruct 
the Head-Complement parameter in a similar fashion: 


[T]he physics of speech, that is, the nature of the articulatory and perceptual 
apparatus requires one of the two logical orders, since pronouncing or perceiv- 
ing the head and the complement simultaneously is impossible. Thus, the head- 
complement parameter, according to this approach, is a third-factor effect. (Trotzke, 
Bader & Frazier 2013: 4) 


Which option is chosen, of course, has to be built into the grammar of individual 
languages, presumably via its statement as a language-particular rule. 

To take another example, Kayne (1994) provided an elaborate UG-based parametric 
explanation of why rightward movement is so restricted in language after language. But 
Ackema & Neeleman (2002) argue that the apparent ungrammaticality of certain “right- 
displaced” syntactic structures should not be accounted for by syntax proper (that is, by 
the theory of competence), but rather by the theory of performance. In a nutshell, such 
structures are difficult to process. A necessary consequence of their approach is that it 
is necessary to appeal to language-particular rules to account for the fact that languages 
differ from each other in the degree to which displacement to the right is permitted. 

The microparametric approach to variation is well designed to capture the fact that 
even closely related speech varieties can vary from each other in many details. The 
question is why one would want to appeal to microparameters when the traditional 
term “rule” seems totally appropriate (see §6.2). The resistance to the idea of reviving 
the idea of (language-particular) rules is unsettling to some, perhaps because the idea of 
“rules” brings back the ghosts of pre-generative structuralism, where it was believed by 


7 An anonymous referee asks: “To be honest, I don’t see how the approach sketched here is different from 
parameter setting. Perhaps it’s my own ignorance of Biberauer et als proposal, but if the hierarchy of 
questions that the learner must address is innate, how does this differ from parameters that are innately 
specified?" As I understand their proposal, the hierarchy of questions falls out from general learning 
principles, though I am hazy on the details of precisely how. 
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some that “languages could differ from each other without limit” (Joos 1957: 96), and the 
spectre of early transformational grammar, where grammars were essentially long lists 
of rules. But to call a language-particular statement a “rule” is not to imply that anything 
can be a rule. Possible rules are still constrained by UG. That of course raises the question 
of what is in UG. An obvious candidate is the Merge operation or something analogous, 
but surely there must be a lot more than that. For example, it is hard to see how the 
broad architecture of the grammar could be learned inductively. Consider the fact that 
syntactic operations have no access to the segmental phonology: There is no language 
in which displacement - Internal Merge, if you will — targets only those elements with 
front vowels. It seems probable that this state of affairs derives from UG. 

However, if the general thrust of the work of John A. Hawkins is correct (see Hawkins 
1994; 2004; 2014), the major constraints on the nature of rules derive from the exigencies 
of language processing. No language has a rule that lowers a filler exactly two clauses 
deep, leaving a gap in initial position. Such a rule, while theoretically possible, is so im- 
probable (for processing reasons) that it will never occur. Norbert Hornstein's approach 
to variation, succinctly stated in the following passage, also stresses that it is not neces- 
sary to appeal to UG to explain why certain logically possible properties of grammars 
do not occur: 


There is no upper bound on the ways that languages might differ though there are 
still some things that grammars cannot do. A possible analogy for this conception 
of grammar is the variety of geometrical figures that can be drawn using a straight 
edge and compass. There is no upper bound on the number of possible different 
figures. However, there are many figures that cannot be drawn (e.g. there will be 
no triangles with 20 degree angles). Similarly, languages may contain arbitrarily 
many different kinds of rules depending on the PLD [ = primary linguistic data - 
FJN] they are trying to fit. However, none will involve binding relations in which 
antecedents are c-commanded by their anaphoric dependents or where questions 
are formed by lowering a Wh-element to a lower CP. Note that this view is not in- 
compatible with languages differing from one another in various ways. (Hornstein 
2009: 167) 


In my view, the idea that a grammar is composed of language-particular rules con- 
strained by both UG principles and third factor principles is an appealing vision that 
stands to inform research on crosslinguistic variation in the years to come. 


8 Conclusion 


Since the late 1970s, crosslinguistic variation has generally been handled by means of 
UG-specified parameters. On the positive side, thinking of variation in terms of pa- 
rameterized principles unleashed an unprecedented amount of work in comparative 
syntax, leading to the discovery of heretofore unknown morphosyntactic phenomena 
and crosslinguistic generalizations pertaining to them. On the negative side, however, 
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both macroparameters and microparameters have proven themselves to be empirically 
inadequate and conceptually nonminimalist. Alternatives to parameters are grounded 
approaches, epigenetic approaches, and reductionist approaches, the last two of which 
seem both empirically and conceptually quite promising. 
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Saussure’s Dilemma: Parole and its 
potential 
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Columbia University 


Saussure’s account of the transformation of Latin to French stress leads to the unintended 
conclusion that parole has a life of its own: parole persists even after it is no longer dictated 
by langue; parole can prevent change or, conversely, presage potential change. Saussure’s 
example is paralleled by intrusive r (Cuba[r]against your friends, not "day[r]and) and the 
competition of napron and its near twin, and eventual successor, apron. Parole lives. 


1 Saussure’s Dilemma 


It seems only fitting to begin this tribute to Steve Anderson, friend and erstwhile UCLA 
colleague, historian of linguistics and confirmed helvétophile, with Ferdinand de Saus- 
sure and his discussion of the history of stress from Latin to French (Chapter III.4-6 
of the Course). That history presents a dilemma for Saussure’s separation of synchrony 
from diachrony and linguistic activity (parole) from system (systéme, langue). 

Here is Saussure’s account of the transition from (ante)penultimate stress in Latin to 
final stress in French: 


In French, the accent always falls on the last syllable unless this syllable contains 
a mute e (a). This is a synchronic fact, a relation between the whole set of French 
words and accent. What is its source? A previous state. Latin had a different and 
more complicated system of accentuation: the accent was on the penultimate syl- 
lable when the latter was long; when short, the accent fell back on the antepenult 
(cf. amicus, anima). The Latin law suggests relations that are in no way analogous 
to the French law. Doubtless the accent is the same in the sense that it remained 
in the same position; in French words it always falls on the syllable that had it in 
Latin: amicum — ami, ánimam —4me. But the two formulas are different for the 
two moments because the forms of the words changed. We know that everything 
after the accent either disappeared or was reduced to mute e. As a result of the 
alteration of the word, the position of the accent with respect to the whole was 
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no longer the same; subsequently speakers, conscious of the new relation, instinc- 
tively put the accent on the last syllable, even in borrowed words introduced in 
their written forms (facile, consul, ticket, burgrave, etc.). Speakers obviously did 
not try to change systems, to apply a new formula, since in words like amicum 
— ami, the accent always remained on the same syllable; but speakers changed 
the position of the accent without having a hand in it. A law of accentuation, like 
everything that pertains to the linguistic system, is an arrangement of terms, a 
fortuitous and involuntary result of evolution. (Saussure 1959: 86) 


To explain modern French final stress, Saussure goes back to its source in Latin, or 
rather to a stage of Romance subsequent to classical Latin but much earlier than modern 
French. He first defines the stress rules for French (final) and Latin ((ante)penultimate, 
depending on the quantity of the penult). That done, Saussure mentions in passing that 
the position of stress in French preserves the original position of stress from Latin, and 
illustrates the claim with examples of the two subcases of Latin stress, on a long penul- 
timate (amicum — ami) and on the antepenultimate when the penult is short (ánimam 
—áme). It is worth mentioning that his formulation “the accent is the same in the sense 
that it remained in the same position" is not original; it repeats a standard observation 
from French philology in the middle of the nineteenth century. Thus in 1862 Gaston 
Paris (and seven other scholars he mentions, p. 11) stated the observation that stress falls 
on the same syllable in French as it did in Latin; for that phenomenon Paris in particular 
uses the apt term "persistence" (persistance) (Paris 1862: 28). Saussure does not use that 
term, but his formulation echoes this earlier tradition. Saussure then mentions the famil- 
iar fact that syllables after the syllable with the “persistent” stress are subject to stress 
reduction. At this point one might think he is preparing to explain how final stress in 
French arose, for example, perhaps by generalizing the word-final stress of words like 
amicum — ami that had undergone apocope. Saussure does not go in that direction. 
Instead, he declines to give a linguistic explanation for the development of final stress 
and places the burden on speakers acting "instinctively" and then on the vague asser- 
tion that “a diachronic fact was interposed.” So although Saussure at the outset seemed 
prepared to explain modern French stress in terms of its source in Latin, the source is 
not relevant to Saussure’s interpretation. He ends with a summary blaming the chaotic 
nature of change: “A law of accentuation, like everything that pertains to the linguistic 
system, is an arrangement of terms, a fortuitous and involuntary result of evolution” 

Saussure seemed to recognize that the Latin rule of (ante)penultimate stress was lost 
as a rule. Further, it cannot have been a rule of the systéme; had it been rule of the 
systéme, it would have been maintained. Saussure’s response was this: 


The synchronic law is general but not imperative. Doubtless it is imposed on in- 
dividuals by the weight of collective usage..., but here I do not have in mind an 
obligation on the part of speakers. I mean that in language no force guarantees 
the maintenance of a regularity when established on some point. Being a simple 
expression of an existing arrangement, the synchronic law reports a state of affairs; 
it is like a law that states that trees in a certain orchard are arranged in the shape 
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of a quincunx. And the arrangement that the law defines is precarious precisely 
because it is not imperative. Nothing is more regular than the synchronic law that 
governs Latin accentuation...; but the accentual rule did not resist the forces of 
alteration and gave way to a new law, the one of French... In short, if one speaks 
of law in synchrony, it is in the sense of an arrangement, a principle of regularity. 
(Saussure 1959: 92-93) 


This analysis, developed in connection with a discussion of six facts of Indo-European, 
is here applied to the Latin stress rule, which was downgraded to a descriptive observa- 
tion about behavior maintained by social convention, analogous to the rule stating that 
"the trees in a certain orchard are arranged in the shape of a quincunx? 

The phenomenon of Latin stress, with its properties of persistence, precariousness, 
and regularity presented a dilemma for Saussure. The phenomenon of persistence im- 
plies that usage (parole) has a life of its own; parole is maintained as parole, as habit, 
transmitted by imitation from one generation to the next. To invoke a metaphor, substi- 
tuting “language” for “body” in Newton’s first law, we could say: “a language at rest 
remains at rest unless it is acted on by an external force? Usage, such as the Latin 
stress rule, can exhibit coherent patterns (such as the elegant parallelism of length of 
the penultimate and antepenultimate position). Moreover, the patterns of parole are ca- 
pable of defining the conditions for change (such as the "persistent" location of stress 
after Latin which conditions post-tonic apocope), and in this way patterns of parole act 
like elements of systéme. 

Saussure's dilemma was that the more he insisted on the dominant, special, pure ex- 
clusionary status of systéme, the more ethereal and abstract system became, and that had 
the paradoxical effect of elevating parole to be the object of investigation. 


2 “Law and Order" and sandhi doublets 


To document the fate of /r/ in weak position (after a vowel, not before a vowel), Ku- 
rath & McDavid (1961) divided the eastern seaboard into four discontinuous zones, two 
northern and two southern, in which /r/ becomes [9] in weak position: northern - in- 
cluding New England (Connecticut River east) and metropolitan New York; southern - 
including Upper South (Virginia, into northern North Carolina) and Lower South (South 
Carolina, Georgia). The four zones are separated by transitional belts which retain some 
form of r (presumably American [1] or *velarized constricted” r [x]). The largest of the 
transitional belts is Pennsylvania, from which rhoticism spread to Midwest and Midland 
dialects. 

The discussion here focuses on the northern zones, which Kurath & McDavid (1961) 
treated as a single zone. As shown in Table 1, in the north /r/ is reflected in weak position 
as [2] after mid and high vowels ([i, u, e, o]). After low vowels ([a/e, 0/9] and here [a] as 
well) what must have been the earlier reflex [2] was lost or absorbed by the vowel. 
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Table 1: Postvocalic rhotic reflexes, North 


high/mid V low V 

[ir], [ur], [er], [or] > [ar/er], [vr/or], [ər] > 
[ig], [ug], [eg], [og] [ag], [03/23], [9] > 
ear [ia] [a/e], [0/9], [9] 

poor [pua] far [fa/fe] 

care [keg] for [fo/f5] 

four [foa] father [fad9] 


Like northern dialects, southern dialects also absorbed [a] after low-mid [a/e, v/», a]. 
Furthermore: “In Southern folk speech, /g/ is often lost, door, four, poor /dog, fog, poa/ 
thus becoming /do, fo, po/” (Kurath & McDavid 1961: 171a and map #156).! 

Kurath and McDavid devoted special attention to sandhi contexts — contexts in which 
the word-final vowel which once had /r/ is used in a phrase with a following word. 
Then the word with original /r/ can be said to have two “sandhi doublets,” depending 
on whether the second word begins with a consonant (when the original /r/ would have 
been in weak position) or with a vowel (when the original /r/ would have been prevo- 
calic). They stated: “...ear, poor, care, four have... the positional allomorphs /ig ~ ior, pua ~ 
puar, keeg (keg) ~ keear (kear), fog (fog) ~ foar (foar)/, and car, for (stressed), father the al- 
lomorphs /ka (ke) ~ kar (ker), fo (fa) ~ for (for), fad ~ fadar/” Note the difference between 
non-low vowels, in which the reflex is [Va(r)], and low vowels, in which the reflex [V(r)] 
lacks [2], since [3] had been absorbed by the preceding low vowel. The idea of calling 
these doublets (and they provide a notation for doublets) suggests a model of the lexicon 
in which a lexical item is composed of multiple subunits, which could be written as an or- 
dered pair such as {[ig]/sandhi before consonant; [igr] / sandhi before vowel}. One might, 
for example, write a doublet for the noun Cuba as pronounced by John F. Kennedy in his 
speech “in the Cuban Missile Crisis" generally as [kubo], as in and then shall Cuba[a] be 
welcomed (16:07), but [kubor] in phrases such as Soviet assistance to Cuba[r] and I quote 
(4:32) and turned Cuba[r] against your friends (15:05).? 

Examples constructed in the spirit of Kurath & McDavid (1961) are given in Table 2, 
top. 

Itis worth drawing attention to the fact that prevocalic sandhi examples with non-low 
vowels have the sequence [ar] (p. 171b); as in [igr, puar, keear (kear), foar (foar)] from the 
list above. The sandhi sequence [VarV] has in effect two segments - [2] and [r] - which 
reflect earlier /r/. Both cannot be original. There must have been an antecedent stage of 
['irV, *purV, *kærV ("kerV), "forV (“forV)] in sandhi position before a vowel. The [a] we 
see now in [iar], etc., had to have been introduced by analogy from other forms to the 
sandhi forms before vowel. 


1 The lexeme poor is treated once as having a mid vowel (171a) and otherwise as a high vowel (170b, 171b, 
172a, 172b). (One, also 171b, is ambiguous.) 
? http://www.historyplace.com/speeches/jfk-cuban.htm. 
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Table 2: Sandhi /r/ and Intrusive /r/, North 


context high, mid V low V 


sandhi ` bal [ug], [ea], [oo] [a/e], [0/9], [9] 
[V(a)tV] | ear and [igreend] far and [fareend] 
poor and [pugrend] for all [foral] 
care and [kegrend] father and [faósraend] 
four and [fograend] 


intrusive fi], [u], [e], [o] [a/e], [0/9], [9] 
[vrv] three and *[Orirend] ma and [marænd] 
two and *[turænd] law and [lorænd] 


day and *[derænd] Martha and [maðsrænd] 
know it * [norit] 


Analogy is relevant to history in another respect (Sóskuthy 2013). Not uncommonly, 
words that ended originally in low vowels without /r/ acquired a non-etymological, or 
“intrusive,” /r/ in sandhi, as in the familiar law and order [lorəndodə] and other examples 
in Table 2. As Kurath & McDavid (1961: 172a) state, 


On the analogy of such doublets as for /fo (fo) ~ for (for)/, car /ka (ke) ~ kar (ker/, 
and father /faóo ~ faóor/, positional allomorphs ending in /r/ are often created in 
Eastern New England and Metropolitan New York for words that historically end 
in the vowels /p ~ 5, a ~ g, al. as law, ma, Martha. Thus one hears law and order /lor 
and pde, lor ond oda/, ma and pa /mar ən(d) pa, me an(d) pe/, Martha and I /maQor 
(meðər) and ai/. 


The examples of intrusive /r/ just cited involved only words which end in a low vowel 
- that is, they have the same vocalism in the non-sandhi environments as words that 
originally ended in /r/ but which absorbed the [a] reflex of /r/; thus law Un (19)/ has the 
same vocalism as originally rhotic words like for /fp (fo)/. But words like three, two, day, 
know, which end in mid and high vowels, differ. Kurath & McDavid (1961: 172b) state: 


It is worth noting that after the normally upgliding free vowels /i, u, e, o/, as in 
three, two, day, know, an analogical "intrusive" /r/ never occurs. The reason for this 
is clear: since /Ori, tu, de, no/ do not end like the phrase-final /r/-less allomorphs 
of ear, poor, care, four (a. pua, keg, foa/, the basis for creating allomorphs ending 
in /r/ is lacking. 


Thus according to Kurath & McDavid (1961), the development of intrusive /r/ involves 
the comparison of stem shapes, for example [lo] with [fo], which are similar and per- 
mit analogy, as opposed to three [Ori] with [ig], which are dissimilar and do not permit 
analogy. 
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This distribution is interesting. What determines whether analogical intrusive /r/ de- 
velops is an arbitrary division of vowels inherited from the previous history of derhoti- 
cism; that is to say, a distinction in vowels involved in the earlier history of reflexes of 
/r/ in weak position continues to have an effect on later developments. Thus parole has 
the property of inertia (persistance), so that later changes (such as the analogical devel- 
opment of intrusive /r/) can be sensitive to properties of parole that persist. At the same 
time as parole is inertial and conservative, parole nevertheless carries with it the possi- 
bility of change. Thus original r-less words ending in low vowels have the potential to 
develop an intrusive sandhi /r/, as happened in northern dialects. Conversely, original 
r-full words had the potential to eliminate the second member of the “doublet” in which 
/r/ reappears in sandhi before a vowel; this is what happened in southern dialects (espe- 
cially Upper South but even in the Lower South sandhi forms with /r/ are “only half as 
frequent as the variants without /r/”, Kurath & McDavid 1961: 171b). 

Parole, then, is inertial but carries the potential for change. This example is similar 
to what Saussure said about Latin stress, that it remained on the syllable where it had 
always been - by convention, or memory, or inertia — but eventually the stress was 
repositioned. 

It might be objected that it would be easy to state a rule inserting /r/ that is sensitive 
to vowel height; insertion would happen only in position after low vowels. But why 
low vowels? Low vowels are not universally more likely than other vowels to adopt 
a phonotactic sequence [VCV] that other vowels. Intrusive /r/ develops only after low 
vowels because it is only low vowels that offered a model for analogical extension, and 
that is a distribution that goes back to a prior change; the restriction to low vowels can 
only be understood by viewing it as the hangover from a previous stage. Moreover, it is 
not just any consonant that reappears; it is just the one sound /r/. The /r/ can participate 
in "intrusive" analogy because the /r/, and only the /r/, was carried over from earlier 
history. The fact that /r/ is involved in analogy at all is a further instance of persistence 
of parole. 


3 "Watergate" and its ilk 


Against this background I want to discuss how innovations can arise directly out of 
speech. The word Watergate and its derivatives can serve as an illustration. As is familiar, 
Watergate is the name of a complex of five buildings built in Washington, D.C., over 
the period 1963-1971. An office building in this complex was used by the Democratic 
National Committee as headquarters leading up to the 1972 election. The Democratic 
offices suffered a break-in, for which staff members of the Republican administration 
were later discovered to be responsible. The break-in triggered an embarrassing scandal 
and, because of the attempt to cover up the original crime, led to the resignation of 
President Richard Nixon. 

A modification ofthe name for this location keeps being applied to more events, which, 
like the original Watergate, include at least two events, layers of agency, times, places. 
The core is the pairing of two events: first, an event carried out in secret and, second, the 
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fallout, including the embarrassment caused by the event for the participants and per- 
haps further developments (cover-up, disclosure). The whole scenario is a rich instance 
of the familiar trope of metonymy, which points to one event - here, the original trans- 
gression - which can invoke associated events (here, the fall-out) and the constituents 
of those events (locus, agents, patients). The name for this complex of events and con- 
stituents, which occurred in 1972, is of course Watergate — the name for the place is 
applied to the whole package of events, by the trope of pars (locus) pro toto (complex of 
events — crime, scandal, cover-up, further fall-out). The semantic operations involved in 
Watergate are familiar, banal tropes. 

Event complexes similar to the original Watergate scenario can be named by the new 
compound {x+gate}, where {gate} refers to the existence of a scandalous event (and fall- 
out) and x refers to a focus - a constituent that is central to the events — such as the 
agent (Billygate) or causal entity (nannygate) or the patient (contragate). 

The morphological structure and semantics of the new compound {x “focus”+gate} 
“event(s) leading to scandal” seem clear, and it seems clear that the compound is related 
to the origin Watergate. How? Given the apparent overlap of {gate} in both, one might 
imagine that the word Watergate was decomposed into two morphemes, {water} and 
{gate}, and that reanalysis provided the model for neologisms. But this cannot be: by 
itself “water” does not mean anything in this context; it is not the focus. And for that 
matter, gate doesn’t mean scandal here in the compound Watergate. In the original word 
Watergate, there is no division; Watergate is the name for the complex as a whole, not 
for any of its constituents. 

And yet Watergate was self-evidently the source for the formula {x+gate} and novel 
applications of the formula. What this means is that “Watergate” - the name for a whole 
complex of agents and events — allowed speakers to imagine a new structure {x+gate} 
whose semantics give overall semantics analogous to the meaning of Watergate (secret 
event and subsequent scandal, specific place or agents, etc.) but in which the event 
complex is broken into two constituents; one of them, {x}, refers to the focus of events, 
and the other part, {gate}, establishes the existence of a secret event and its attendant 
scandal involving the focus {x}, whereas in the source Watergate, the whole included all 
the components. 

Two aspects of this change are significant. First, the new structure is motivated by the 
inherited word, but it is not a copy; it cannot be generated by a proportional analogy. In- 
stead, what the example shows is that speech has the potential of providing motivation 
for creating new speech directly. To say it another way, speech is not just speech; speech 
invites modal possibilities. The second point is that the source here really is speech that 
actually occurred in real time: Watergate started as a single event complex that occurred 
at some time in history; it did not start as a pattern. That is, a singular event and the ac- 
companying speech give rise to an innovation; speech creates speech. This new {x+gate} 
is a virtual structure which might exist indefinitely. We cannot verify its existence until 
it is acted on. Therein is a property of language that has eluded description: the fact that 
speech happens, that activity matters, it happens when a novel formation is used, and it 
happens to the extent that neologisms are created and used in speech. 
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This example, then, suggests a more active role for parole (performance, speech) than 
has usually been assumed. In this instance actual speech from a very specific historical 
time (1972) provided the model and created the potential for new speech, and that is what 
resulted. It is worth stating that speech is not just blind activity; speech comes with 
implicit patterns, whether firmly established or - as in this case — potential, possible, 
modal speech. 

It could be mentioned that this formation, along with similar neologisms motivated 
by alcoholic, have distinctive stylistic overtones and spheres of usage - in the personal 
sphere, gentle mockery (shopaholic, chocoholic) and not-so-gentle journalistic irony for 
the former (Camillagate). The News History Gallery at the Newseum? in Washington, 
D.C., devoted to the history of journalism, has an exhibit called “The ‘Gate’ Syndrome,’ 
illustrated by five examples, starting with Koreagate (1976). * 


4 "(N)apron" as dynamic doublet 


A somewhat similar change is the change from napron to apron in Middle English. As 
is familiar, a dozen or so nouns which had once begun with an initial consonant n lost 
the n and came to begin with the vowel of the first syllable. According to the standard 
analysis, this happened because when such nouns were used with the indefinite article 
a(n), a sequence of [anV] would result, and then it is unclear whether the intervocalic 
[n] belongs to the stem of the noun or to the article. The ambiguity opened up the 
possibility that the [n] could be attributed to the article and the noun could be reanalyzed 
as beginning with a vowel. Subsequently the stem shape without the vowel could be 
extended to all contexts; thus {a+napron} > [anapron] was analyzed as {an+apron} > 
[anapron], leading to the use of {apron} elsewhere. As is well known, the converse also 
occurs, where nouns beginning with an initial vowel (an ewt) acquired an initial n from 
the indefinite article (> a newt). It is not clear why the change of metanalysis should be 
able to go in either direction. 

This standard analysis discusses only the end-points of this change - prior to met- 
analysis, after metanalysis - but does not describe how the change progressed. To get a 
sense of how this change actually proceeded, I attempted to trace the history of spellings 
(n)apron in Middle English with an eye to variation in the choice of the word form in 
different contexts. The task was rather more challenging than I had expected. The word 
(n)apron is quite specific. It occurs infrequently, primarily in wills and inventories of 
good to be bequeathed. (And also, as will be noted below, in a description of the rules 
of the household of Edward IV.) The item is mentioned only in a minority of the wills 
or inventories available, and usually when the deceased is a woman. For example, the 
extensive Wills and Inventories of Bury St. Edmunds has approximately 150 printed pages 
of wills from the beginning of the fifteenth century (one will from 1370, then 1418, etc.) 


3 http://www.newseum.org/ 

4 Arnold Zwicky calls {gate} a “libfix” - “lib” in the sense of “liberated” — which captures the idea that a 
mental operation extracts a new affix (https://arnoldzwicky.org/2010/01/23/libfixes/). The author wishes to 
thank the editors for this and many other valuable and droll comments and corrections. 
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to the late sixteenth century (1570), and has no instances of the word in either variant, 
napron or apron. That, despite instances such as the will of one Agas Herte (a. 1522, pp. 
114-18), who bequeathed about 50 distinct household objects to her son, including “ij 
tabyll clothes, vj napkyns, iiij pleyne and to of diap, a salte saler of pewter...” and about the 
same number to her daughter, including “ij tabell clothes, vi napkyns, iiij pleyn and ij of 
diap, and a pleyn towel.." Among all the items she bequeathed, including the items made 
of cloth just mentioned, no (n)apron was mentioned. This might because this household, 
and other households as well, did not use (n)aprons; it might be they were considered 
too insignificant to be mentioned in bequests (though towels and napkins and sheets are 
recorded regularly). In any event, the frequency with which (n)apron appears is modest. 
In short, it has proven difficult to find document sets in which (n)apron is mentioned 
multiple times; examples are isolated. To maximize the range of texts examined, I used 
Hathitrust/Google scans subjected to OCR. I searched for both napron and apron, both 
singular and plural, in variant spellings. 

We can first take a quick look at chronology, using a ledger (Fabric rolls) kept by the 
York Minster which recorded miscellaneous expenses annually. The entries are written 
in Latin, though names for some items specific to the contemporary realia appear in 
English. Half a dozen times the rolls record payment for the costs of masonry, both 
for wages and equipment - aprons and gloves for masons (called “setters”). The earliest 
record from 1371 surprisingly has n-less aprons (ij aprons et cirotecis 'two aprons and 
gloves', 1371). Then at the beginning of the fifteenth century come two instances of 
naprons: In remuneracione data cementariis vocatis setters ad parietes cum naprons et 
cirotecis, per annum 9s. 10d. ‘as compensation given to the masons known as setters at 
the wall with aprons and gloves, annually 9s. 10d’ (1404); In ij pellibus emptis et datis 
eisdem pro naporons, ‘two hides were bought and given to them to serve as aprons’ 
(1423). At the end of the fifteenth century there are two examples of Latin limas (duobus 
limatibus, 1497-98; Pro ij limatibus, 1499), and shortly thereafter, aprons (pro ij le 
aprons de correo pro les setters per spacium ij mensium, 12d. ‘for two aprons of hide for the 
setters for the period of two months, 12 shillings’ (1504). The use of aprons in 1371 seems 
anomalously early (could it be an error in transcribing the text?). This anomaly aside, 
the examples suggest a chronology: napron was used in the fifteenth century (1404, 1423) 
and shifted to apron the beginning of the fifteenth century (1504). Other texts suggest 
there was still some variation in the sixteenth century. By 1600 apron had taken over. 

Against the background of generally skimpy attestation of (n)apron in the fifteenth 
and sixteenth century, there are two texts which offer enough examples to allow us to 
say something about usage. One is a single text, the so-called Liber Niger Domus Regis, 
which specifies the duties and compensation of the staff of King Edward IV’s household 
in the last quarter of the fifteenth century (c. 1480). A modern edition compiles three 
manuscripts (discussion, Myers 1959: 51-60). The oldest is a manuscript from the end 
of the fifteenth century, which served as the basis for the famous 1790 publication by 
the Society of Antiquaries (abbreviated “A”); however, text A is now defective, and it 
also appears that the 1790 edition took some liberties, so the printed 1790 edition cannot 
be trusted to represented the oldest text A. In the accompanying Table 3 I’ve cited the 
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location of readings from the 1790 reading edition in ||. The next oldest is a sixteenth- 
century manuscript (preserved in the Public Records Office, the Exchequer, abbreviated 
E), from the era of Henry VIII, is similar to A but fuller. Third, the youngest of the three 
manuscripts, known as Harleian 642 (here H), is a seventeenth-century copy made by 
Sir Simonds d’Ewers. In fact, differences recorded in footnotes by Myers in his edition 
are minimal and affect the analysis here in only one respect, mentioned below. 

There are basically two contexts (with one additional outlier). Examples repeat over 
the descriptions of many different servants. The twelve examples of (n)apron are given 


in abbreviated form Table 3. 


Table 3: (n)apron in Liber Niger of Edward IV 


type text [variants] 

855 MOD they have part of the “yeftes* geuvn to the 

49.7d| houshold... but none aprons [1790: 
"gyftes"] 

$62 DO [takith] at euery of the iiij festes of the 

52.28d| yere, naprons of the great spycery 

$62 DO take naprons also at euerych of the iiij 

52.28d| festes 

$80 DO etithe in the halle; taking for wages... and 

71.6| nyst lyuereye, napors, and parte of the 
generall giftes 

$80 DO taking for his wynter clothing chaunces, 
napors, parte of the giftes generall 
[extended passage, absent in 1790] 

874 1po Eche of them takethe... j napron of lynyn 

61.26d cloth of ij ellez [1790: a naperon] 

877 lbo  Ateuery of the iiij festes, j napron ofj 

65.19d elle, price vjd. [1790: one napron of one 
elle] 

$33 PRP [he takith]... ij elles of lynen clothe for 

36.10u aprons, price the elle, xijd. 

877 PRP ij ellez of lynyn cloth... for naprons 

64.25d 

877 PRP. jelle of lynnyn clothe for naprons 

65.8d| 

§77 PRP. j elle for naprons of lynyn cloth 

64.41 

$80 pre “and for chaunces iiijs. viijd.* of napors at 

71.26d| euerye [H?*; H aprons] 
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In one isolated instance, the noun is preceded by none; the nasal might have elicited 
the following apron. The other eleven tokens are split between two contexts. In six 
instances the noun is the direct object of the verb ‘take’ (listed as ^no"). The entities 
taken have already been formed into garments; all have napron. Within this group of six 
examples, in two of these six, marked here as "po" the word napron is preceded in texts 
E and H by j, that is, Roman numeral ‘one’. (The published 1790 version has the indefinite 
article a napron in one instance and the written word one napron in the other, rather than 
the numeral.) Both E and H use numerals consistently in discussions of compensation; 
prices of elles of linen cloth are cited with numerals, such as j elle. The numeral here 
must be original, thus j napron in both examples. I will return to these in a moment. 

The second group of six examples involves the statement that servants receive compen- 
sation in the form of linen cloth which is supposed to be turned into (n)aprons, expressed 
by a preposition, usually for, once of. In this context the entity referred to as (n)apron 
does not yet exist; the noun has a future attributive sense: "the speaker wishes to assert 
something about whatever or whoever fits that description" of being (or becoming) an 
apron (Donnellan 1966: 285). An example is: at eueryche of the iiij festes of the yere, of the 
clerk of greete spycery, ij elles of lynen clothe for aprons, price the elle, xij. This one sen- 
tence has aprons, which seems to suggest that an attributive reading implies aprons. But 
this sentence is the only instance with aprons among the five tokens of this attributive 
context, so an attributive reading by itself can't explain aprons in this specific example. 
I return to this token below. 

Let us turn to the second text, namely Durham wills and inventories. The second text is 
not, strictly speaking, a single text but a series of wills; still, they are all from one locale 
and one tradition over a short interval, from 1562 to 1570, and can be treated as a single 
text. (In fact, there is a string of four tokens of (n)apron in a row.) The tokens are given 
in Table 4. 

Only two contexts occur in this small corpus. One context is represented by two 
tokens, in which the indefinite article and noun are separated by a modifier (a linn Apron, 
a blewe apron). The modifier makes these constructions novel. This pair of examples 
suggests two thoughts: that the innovative form apron is favored to the extent the context 
in which the noun is used is non-idiomatic, novel; and the concept of idiomaticity is a 
gradated (not discretely binary) parameter. To continue down this path, both in the 
example from 1562 and the 1570 example (from volume III of these documents), an apron 
occurs in the middle of a miniature list of three bequeathed items. Lists by their nature 
hint that a set of entities could be extended, so they promise a modal, possible, open- 
endedness. Thus it appears that open-endedness favors the innovative form an apron. 
In contrast, in the will of cook William Hawkesley (presented in Table 4 as a block of 
four tokens), the second through fourth tokens have a fixed phrase a napron, and the 
whole construction, ‘I give to x a napron’, is a standard idiom of bequests. Thus fixed 
idioms use the inherited older form a napron. In 1569 Alice Barnes receives two worsted 
items; possibly the parallel in material is the critical information, and the fact that one 
is a (n)apron is incidental. 
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Table 4: (n)apron in Durham Wills and Inventories 


type text 


MOD It’ I bequith to Agnes Carter a linn Apron. (1.277, 
1567) 

MoD Itm I gyve to Helenor Huntley iiij?' blake patletts iiij? 
cherches a blewe apron & ij? velvett pattletts (1.343, 
1570) 

ART to Thomas Burdon a busshell of wheat - to Jane 
Brantinga' a line kyrcheff an apron & a pair of hoose 
(1.198-99, 1562) 

ART  Igeveunto Elizabeth Hackforth a kerchif, a raill, a 
smock, an apron and all my workday rayment and in 
mony 3s. 4d. (11.56—57, 1570) 

Itm I gyve to katheryn barnes ij? vj*. I'm I gyve to 

ART  thomeis hynde y! was my p'ntice an apron & a new 

ART  fyshe knyffe. | Itm I gyve to thomas capstone a 

ART  napron.|It'mIgyve to thomas boswell a napron.| 

ART It’mI gyve to luke hanynge a napron & a fyshe borde. 
(1.327, 1570) 

ART Andto alles Barnes a gowne of worsted & a napron of 
worsted (1.305, 1569) 


r 


The most interesting example is the first example of the 1570 set. Throughout this will 
of William Hawkesley, the recipients are identified in an unambiguous but not expansive 
fashion; the recipients are listed by name alone (22 xx) or with name and geographical 
location (3 xx) or name and relationship (9 xx), such as mother-in-law or midwife. That 
is, the recipients are presumed to be known by name with the briefest of descriptions, 
almost titles. Against that background, the description of the first recipient of aprons, 
thomeis hynde y! was my p’ntice, stands out; given its relative clause vd (‘that’), the iden- 
tification of Thomas is relatively elaborate. Indirectly, this means that the bequest - an 
apron and a fish knife - is out of the ordinary, atypical. In contrast, in the three be- 
quests that follow immediately thereafter are idiomatic. It appears, then, that novel or 
unexpected bequests of aprons - the bequest itself or the recipient — are expressed by 
the innovative (an) apron, while less novel scenarios are expressed by the older form a 
napron. 

This takes us back to the one example in the Liber Niger Domus Regis Edward IV 
which had aprons (other than none aprons): [he taketh] at eueryche of the iiij festes of the 
yere, of the clerk of greete spycery, ij elles of lynen clothe for aprons, price the elle, xij*. In 
and of itself, the sentence is unremarkable and indistinguishable from the other examples 
with prepositions which had naprons. What might be atypical is the office described here, 
which is that of sewar, the highest ranking and first mentioned of the king’s servants: A 
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SEWAR FOR THE KYNG, wich owith to be full cunyng, diligent, and attendaunt. He receueth 
the metes by sayez and saufly so conueyeth hit to the kinges bourde with saucez according 
therto, and all that commith to that bourde he settith and dyrectith (§33, p. 112). In this 
instance, although the act of taking aprons is not exceptional, the recipient - the sewar 
- is unique. This is then similar to thomeis hynde y! was my p'ntice from Durham wills 
and inventories, in the sense that the non-idiomatic character of the example derives 
from the recipient, not the (n)apron phrase. That should not be surprising, since the 
act of bequeathing includes a recipient as well as the item bequeathed. The innovative 
aprons here acts effectively as an honorific to draw attention to the unusual status of the 
recipient, as it did with thomeis hynde y! was my p’ntice. 

In general, it appears the innovative form is favored if the transfer of apron is novel, 
not typical, and this extends to the recipient of the transfer (relative to other recipients). 
This principle applies to both example sets from different stages of the change. This 
distribution - unidiomatic context prefers the novel form - turns out to match other 
instances of the competition between equivalent morphological forms. Thus in contem- 
porary Czech the locative singular (used with certain prepositions) can be either the 
traditional ending {-e} or a new ending {-u}. (That ending is original with nouns of the 
Indo-European u-stem declension, but its use with o-stem masculine nouns is new.) The 
parallel is that the traditional {-e} is used with "typical" combinations while innovative 
{-u} is used with atypical contexts (Bermel 1993). 

There is another regularity of some interest that applies to both texts. We saw above 
that in Liber Niger there were two instances in which (n)apron followed the numeral 
j Cone’), and in both the older form napron was used. The examples are more or less 
equivalent in meaning to a true indefinite article as in a napron; the two examples of j 
napron (with napron) with the numeral invite the suspicion that true indefinite articles 
at this time might have napron, if they were attested. Conveniently, there is a contempo- 
raneous will that has two tokens of an indefinite article one after the other: Also I gyve 
to Margarete Holton my best kyrtill & a napron. Also I gyve to Elisabeth Wike a smok 
& a napron (will of Jone Montor, 1489, Surrey Wills, p. 95). A slightly later example is 
consistent: A jak & a salet, a gorget, ij gussettes, a napron, and iij gauntlettes (York wills, 
p. 35, 1512). These examples at least suggest that the context with an indefinite article 
used the more conservative form. 

To return to the other text under discussion here, Durham wills and inventories, there 
were two recognizable contexts. One involves an indefinite article split from the noun; 
it does show that the change had progressed to novel (unidiomatic) contexts. The other 
context had seven tokens with an indefinite article (not separated from the noun); the 
older form napron was used 4 times, the novel form time 3 times. That is to say, the novel 
form apron was slow to appear in the context in which the indefinite article immediately 
preceded the noun. For both periods (late 1400s, third quarter of 1500s) it appears that a 
construction with an indefinite article uses apron less (or at least not more) than other 
contexts. 

Now the standard analysis is that the ambiguous combination of indefinite article and 
napron led to a reanalysis of {a+napron} to {an+apron}. If so, it would be natural to expect 
that apron would be used first in the context of reanalysis and only later in other contexts 
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- that is, it should appear earliest with the indefinite article. But we just saw that in both 
texts, apron was used in other contexts when an apron was not yet used (the first text 
plus the auxiliary wills) or not used as frequently (the second text). 

This suggests a revision of the account of reanalysis. Since the appearance of apron 
is in fact not tied to the indefinite article, the unit apron appears to have some degree 
of autonomy. The reanalysis consists not of replacing the underlying shape of the noun, 
but it consists of imagining the possibility of an alternate word form {apron}, which co- 
exists, for a time, with an alternate sublexeme {napron}. Imagined {apron} becomes real 
only when it is actually used. Following the general principle that innovative forms ap- 
pear first in novel contexts, {napron} was maintained with the indefinite article — in fact, 
the most conventional and idiomatized construction - while the sublexeme {apron} was 
used in novel contexts cited above, such as [takith]... ij elles of lynen clothe for aprons 
and ... to thomeis hynde y! was my p’ntice an apron. Over time, {apron} and {napron} 
compete; {apron} keeps on increasing, in a fashion that could be understood as the other 
half of Newton’s first law: once in motion, a body, or linguistic subsystem, will remain 
in motion. 

Semantically the new demilexeme {apron} must be basically similar to traditional 
{napron}. For example, both demilexemes refer to protective coverings, usually of cloth, 
though in artisanry, aprons could be sheepskins. In the York fabric rolls - the record of 
expenses of the York expenses, including irregular expenses of masons and their equip- 
ment — we observe naprons used at the beginning of the fifteenth century - In ij pellibus 
emptis et datis eisdem pro naporons 'two hides were bought and given to them to serve 
as aprons' (1423) - and then the form is aprons at the end of the fifteenth century]: pro 
ij le aprons de correo pro les setters per spacium ij mensium, 12d. ‘for two aprons of hide 
for the setters for the period of two months, 12 shillings' (1504). That is only to say that 
napron and apron seem to have the same extension. 

Still, despite the overlap in extension, there are indications that the two sublexemes 
began to develop slightly different connotations.” Two facts argue for this. 

The first, perhaps unexpectedly, has to do with translations of the Bible. As is well- 
known, John Wyfcliffe translated much of the Bible from the Vulgate, around 1382. (His 
translation was finished by his followers after his death.) A passage of interest is Genesis 
3:7 - the famous story of the nakedness of Adam and Eve - for which Wycliffe (or his 
followers) translated Vulgate ...cognovissent esse se nudos consuerunt folia ficus et fecerunt 
sibi perizomata as ...and when they knew that they were naked, they sewed the leaves of a 
fig tree, and made breeches to themselves. 

Wycliffe's Bible became the model for an extended tradition of English translations 
thereafter, but with a difference in this passage. Starting with Tynsdale (1534), the sub- 
sequent translations have a different noun in Genesis 3:7: ...vnderstode how that they 
were naked. Than they sowed fygge leves togedder and made them apurns. The transla- 
tion with aprons continues through Cloverdale (1535), the Great Bible (1540), Matthew's 
Bible (1549), the Catholic Bishops' Bible (1568), the Geneva Bible (1587), and finally the 
King James (1604-1611). All have aprons (variant spellings) except for the Geneva Bible, 
a retrograde Protestant Bible which returned to Wycliffe's breeches. 


5 In a fashion consistent with Bréal’s (1900: ch. 2) “law of differentiation" of synonyms. 
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The improvised fig-leaf garment of Genesis 3:7 wasn’t exactly an apron in the sense 
of linen or hide aprons, but it was somewhat similar. Why was apron used instead of 
napron? One reason might be that apron was the innovative form, and innovative forms 
are more appropriate than conventional forms for encoding semantic extensions. There 
is another possibility. As we saw in the Liber Niger, naprons were something that would 
result from linen, and their value was defined by the price of the linen used to make them. 
In earlier wills naprons were classified with other items of cloth with different functions; 
naprons belonged to the naperie (the collection of similar cloths) along with napkins 
(same sense as modern) and borde clothes or table cloths. So the sublexeme {napron} em- 
phasized the origin of the entity in cloth or hide; secondarily, such a flat piece of material 
could be donned for protection. With the sublexeme {apron} the dominant feature is not 
that it was made from material (or hide); the dominant feature is that it is a garment worn 
to provide protective covering. This difference in the ranking of features - MATERIAL as 
opposed FUNCTION - might be why the corrections to Wycliff's Bible used apron. 

A second indication is the way items are grouped in Chesterfield wills and inventories 
of household goods. For example, from Derbyshire #177 (Margaret Capper, 1588) here is 
a partial list of items (omitting tools and animals). Items are listed in the inventory in 
natural classes. Categories are added here: 


«furniture» / 4 bedstids in the Chamber / 2 bedstids in the parlar with pented 
Cloates about them / 1 bed teaster of Cloathe / 3 bed stids in the nether Cham- 
ber / «bedding» / 8 pillobears / 8 hand towels / 4 shietes / «garment» / 1 smock / 2 
aperns / 1 bruse and a grater / 1 Coat and a pear of house / 1 Gone and a for kertle 
[?] / 1 buckrame savgard / «utensils» / 2 Chamber pots / 1 morter and a Cresset / 
3 Chaffindishes / 1 Skomar and a ladle of brase / 9 bear potes and 2 black potes / 1 
falling bord in the house / 3 pans 2 ketles / 1 basson brase / 4 brase potes 


Note that aprons is listed next to smock and other garments. (The listing of a “bruse 
and grater” below aperns seems out of place.) By this time, in the late sixteenth century, 
an {apron} was classified as a garment. 

Does this explain why {apron} continued to displace {napron}? Possibly. The demilex- 
eme {apron} removes aprons from the domain of the naperie (the collection of pieces of 
fabric) to the domain of garments. The extension may be the same, but the intension 
changes, by the re-ranking of the semantic features of the two demilexemes: {napron} 
ranks the material over the function, whereas {apron}, while it does not that fabric may 
be involved, ranks garment and its function of covering as more important. 

There are several conclusions here. The two demilexemes have a certain autonomy: 
they have overlapping but not identical semantics; they have different preferred contexts 
in which they appear. From this it follows that the change of napron to apron is not a 
simple substitution of one form for the other. Next, the newer form apron seems not 
to appear in the context of an indefinite article ahead of other contexts, as might be 
expected if apron merely replaced napron. This again implies that apron and napron are 
somewhat separate entities. Third, the ambiguity of [anapron] made the change possible, 
but the change was the creation of two demilexemes here, {napron} and {apron}, not a 
reparsing. 
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5 Conclusion 


The examples above suggest that parole exists, that it has a role in language. Saussure, 
as we saw, did his best to hide parole from view, but it ended up that parole has a life 
of its own: it required its own set of rules and it was maintained (the accent stayed 
on the same syllable as in Latin) without justification in the system.® All the action of 
stress in Romance was in parole, not systéme. In other examples, we saw that parole 
is maintained from one generation to the next, not because it is motivated by higher 
principles, but because it was the usage and it was then transmitted as usage. Parole 
can also shape other changes (such as the apocope of post-tonic vowels in the transition 
from Latin to French and the restriction on intrusive /r/ to low vowels). Variants of 
lexemes, such as {napron} and {apron}, have partially separate existences and properties, 
including semantics. 

Parole is not always static and it is not one-dimensional. Parole is, after all, activity, 
and human activity implies the possibility of more activity and other paths of activity, 
which may differ from inherited activity. Parole is habit infused with potential. 


é Boris Gasparov (2013) has argued that Saussure’s thinking was more complex (and less rigidly categorial 
and structuralist) than the subsequent reception would have it (especially chapter 4, pp. 111-37). 
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On looking into words 
(and beyond) 


While linguistic theory is in continual flux as progress is made in our ability 
to understand the structure and function of language, one constant has always 
been the central role of the word. On Looking into Words is a wide-ranging vol- 
ume spanning current research into word-based morphology, morphosyntax, the 
phonology-morphology interface, and related areas of theoretical and empirical 
linguistics. The 26 papers that constitute this volume extend morphological and 
grammatical theory to signed as well as spoken language, to diachronic as well 
as synchronic evidence, and to birdsong as well as human language. 


